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Abstract 

The  relationship  between  the  structure  of  line  codes  and  the  error 
probability  for  baseband  PCM  systems  with  intersymbol  interference  is  studied 
for  a broad  class  of  finite-state-machine  codes.  The  model  for  a repeatered 
transmission  line  is  reduced  to  a digital  model  including  the  line  encoder, 
linear  dispersive  digital  channel,  additive  Gaussian  noise,  and  a fixed 
threshold  detector.  The  error  probability  for  a first  order  channel  model 
(exponential  response)  is  then  found  numerically  using  the  Gram-Charlier  series 
representation  for  the  Gaussian  noise  probability  density  function.  In  order  to 
accomplish  this  computation,  the  moments  of  the  intersymbol  interference  are 
derived  for  this  channel  model  via  a property  of  the  encoded  channel  sequence 
statistics. 

The  error  probability  results  are  used  to  show  that  a simple  approximation 
for  channels  which  cause  appreciable  intersymbol  interference  over  a span  of 
many  symbols  (for  example,  more  than  fifty)  Drovides  accurate  results.  This 
approximation  demonstrates  the  multimodal  nature  of  the  probability  density  of 
intersymbol  interference,  and  suggests  a new  method  of  Decision  Feedback 
Equalization  which  removes  most  of  the  deleterious  effects  of  the  intersymbol 
interference  on  the  error  probability.  It  is  also  shown  that  the  error 
propagation  which  is  inherent  in  Decision  Feedback  Eaualizers  can  be  effectively 
controlled  with  this  new  equalizer. 
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intersymbol  interference  is  also  discussed.  A method  for  comparing  the  error 
performance  of  different  codes  is  proposed  for  the  case  when  the  decision  device 
in  the  system  is  a fixed  threshold  detector.  This  method  is  based  on  the 
multimodal  nature  of  the  intersymbol  interference,  and  is  directly  related  to 
the  error  probability.  The  analysis  of  the  error  performance  for  different 
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codes  provides  insight  into  the  relationship  between  the  statistics  of  the 
channel  sequence  generated  by  the  encoder  and  the  error  probability  at  the 
threshold  detector,  and  the  results  suggest  code  construction  techniques  which 
minimize  the  error  probability.  This  is  an  improvement  over  the  current 
techniques  which  are  primarily  heuristic,  and  consequently,  do  not  relate  the 
code  structure  directly  to  the  error  probability.  An  example  of  a code  desistn 
procedure  is  given  for  this  case.  In  addition,  an  example  of  a code  design 
procedure  is  given  for  the  Decision  Feedback  Equalizer  mentioned  previously. 

Finally,  the  problems  associated  with  decoding  finite-state-machine  block 
codes  are  discussed,  and  methods  for  detecting  transmission  errors  at  the 
destination  terminal  are  considered.  The  existence  of  single  error  correcting 
finite-state-machine  codes  with  moderate  redundancy  is  also  demonstrated. 
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CHAPTER  1 
INTRODUCTION 


1 . 1 Preview  of  Results 

In  practical  pulse-code  modulation  (PCM)  data  transmission  systems,  some 
form  of  coding  is  used  before  the  data  are  transmitted  through  the  channel. 
These  codes,  which  are  called  line  codes,  provide  redundancy  for  spectrum 
shaping,  error  control,  and  symbol  synchronization  in  the  system.  The  primary 
characteristic  of  the  random  sequences  which  are  generated  by  line  encoders  is 
the  correlation  between  the  channel  symbols.  This  correlation  ha3  a great 
effect  on  the  distortion,  called  intersymbol  interference,  which  is  a result  of 
the  bandlimited  nature  of  the  channel.  In  this  thesis,  we  study  the 
relationship  between  the  line  code  structure  and  the  intersymbol  interference. 

Moreover,  we  study  the  effects  of  this  intersymbol  interference  on  the 

% 

probability  of  error  when  the  channel  is  also  disturbed  by  additive  Gaussian 
noise. 

The  results  of  this  study  differ  from  those  currently  available  in  that  we 
focus  directly  on  the  statistics  of  the  intersymbol  interference  which  result 
from  the  correlated  sequence  generated  by  the  encoder  being  passed  through  the 
linear  dispersive  channel.  In  particular,  we  demonstrate  a method  for 
calculating  the  error  probability  for  a channel  with  an  exponential  response. 
Our  approach  does  not  follow  the  current  practice  of  truncating  the  impulse 
response  of  the  channel  to  a finite  number  of  symbols  in  order  to  calculate  the 
probability  of  error.  The  relaxation  of  this  assumption  allows  us  to 
concentrate  on  the  intersymbol  interference  statistics  for  channels  which 
exhibit  an  appreciable  response  over  a span  of  many  symbols.  This  leads  to  a 
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simple  model  for  channels  with  a slow  response.  Furthermore,  we  will  see  that 
the  probability  density  of  the  intersymbol  interference  in  this  situation 
consists  of  a finite  number  of  impulses.  We  will  find  that  the  multimodal 
nature  of  the  intersymbol  interference  is  related  to  a parameter  of  the  channel 
sequence,  known  as  the  digital  sum.  This  result  is  used  to  demonstrate  a 
decision  feedback  equalizer  which  effectively  removes  the  intersymbol 
interference.  It  is  shown  that  the  error  propagation  which  is  inherent  to 
decision  feedback  can  be  effectively  controlled  in  this  new  equalizer. 
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The  insight  gained  from  this  channel  model  also  provides  a method  of 
comparing  the  error  performance  of  different  line  codes.  This  is  an  improvement 
over  current  techniques  which  rely  on  heuristic  parameters  or  Simulation.  7n 
addition,  we  are  able  to  construct  codes  based  directly  on  the  minimization  of 
the  error  probability.  This  is  more  desirable  than  the  current  practice  of 
using  heuristic  ideas  for  the  construction  and  then  checking  the  results  with 
simulation  or  calculations. 

The  structure  of  the  line  codes  we  consider  also  allows  for  the  detection 
of  transmission  errors  at  the  receiving  terminal.  A discussion  of  techniques  to 
accomplish  this  is  provided  in  this  thesis.  Furthermore,  we  demonstrate  the 
existence  of  ternary  line  codes  which  are  capable  of  correcting  single  errors 
while  requiring  only  moderate  redundancy. 

1 .2  Pertinent  Background  and.  Literature 

PCM  signals  contain  frequency  components  from  dc  to  an  upper  limit  which 
depends  primarily  on  the  rate  of  transmission,  and  therefore,  PCM  systems  are 
called  baseband  systems.  All  physical  channels  are  bandliraited,  and 
consequently  they  distort  any  wideband  signal  which  is  transmitted  through  them. 


When  digital  PCM  signals  are  transmitted  through  a oandlimited  medium,  the 
distortion  at  every  instant  of  time  is  not  of  fundamental  importance  because  the 
digital  data  is  recovered  by  sampling  the  received  signal  at  regular  intervals. 
The  distortion  at  each  sampling  time,  therefore,  is  the  primary  consideration. 
Nyquist's  classic  result  [1]  concerning  this  distortion  provided  the  link 
between  the  speed  with  which  digital  data  may  be  transmitted  and  the  bandwidth 
of  the  medium;  namely,  when  the  symbols  are  transmitted  at  the  rate  of  1/T 
symbols/sec,  the  minimum  bandwidth  of  the  medium  for  distortionless  reception  is 
1/2T  hz.  Furthermore,  the  frequency  characteristic  of  this  minimum  bandwidth 
medium  must  be  rectangular,  which  is  physically  unrealizable. 

Research  on  data  transmission  systems  has  produced  many  theoretical  results 
pertaining  to  methods  for  designing  media  which  approximate  the  behavior  of  the 
distortionless  baseband  channel.  These  normally  require  a greater  bandwidth 
than  the  theoretical  minimum  because  of  the  unrealizability  problem,  and  because 
of  the  methods  used  for  sampling  the  analog  signal  (symbol  synchronization).  A 
general  introduction  to  these  techniques  and  results  is  provided  in  [2]  and  [3], 
and  a collection  of  important  papers  in  this  area  has  been  edited  by  Franks[4], 
An  extensive  bibliography  and  survey  may  be  found  in  [5]. 

A fundamental  consideration  in  any  communication  system  is  interference 
from  noise  which  is  external  to  and  independent  of  the  information  signal.  The 
standard  technique  of  analysis  is  the  modeling  of  this  noise  as  an  uncorrelated 
Gaussian  random  process.  The  most  natural  and  widely  accepted  criterion  for  the 
performance  is  the  probability  that  a digital  symbol  will  be  received 
incorrectly  (the  probability  of  error).  For  a distortionless  channel,  the  error 
probability  is  a function  of  the  signal-to-noise  ratio  (SNR)  only.  However, 
when  there  is  distortion,  the  error  probability  also  depends  on  the  statistics 
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of  the  distortion.  These  statistics  in  turn  depend  on  the  impulse  response  of 
the  channel  and  the  statistics  of  the  digital  signal.  The  most  common 
assumptions  made  about  these  two  factors  are  that  the  impulse  response  of  the 
channel  is  nonzero  for  a finite  length  of  time  only,  and  the  data  symbols  which 
are  transmitted  over  the  channel  are  statistically  independent  binary  random 
variables.  The  latter  assumption  restricts  the  control  of  the  intersymbol 
interference  to  the  design  of  the  medium  (which  includes  pulse  shaping, 
equalization,  etc.).  The  results  of  these  analyses  provide  various  optimal 
pulse  shaping,  equalization  and  detection  schemes.  These  results,  however,  are 
optimal  with  respect  to  criteria  other  than  the  error  probability  (mean-squared 
error,  peak  distortion)  because  the  direct  analysis  of  error  probability  has  not 
yielded  insight  on  how  the  intersymbol  interference  statistics  affect  the  error 
probability. 

The  assumption  of  statistically  independent  binary  random  variables  has 
another  weakness.  In  the  design  of  actual  systems,  some  form  of  coding  is  used 
before  the  digital  signal  is  transmitted.  That  is,  a sequence  of  statistically 
independent  binary  random  variables  is  transformed  into  another  sequence  of 
random  variables,  which  is  usually  correlated.  Moreover,  the  channel  sequence 
is  often  multilevel  (in  particualar,  ternary)  because  less  reduction  in  the 
symbol  transmission  rate  is  necessary  (or  an  increase  in  rate  is  possible),  and 
the  necessary  increase  in  the  signal  power  to  overcome  the  reduced  noise  margin 
is  not  a problem.  Techniques  have  been  developed  to  calculate  the  error 
probability  in  these  situations;  however,  the  nature  of  the  intersymbol 
interference  is  not  apparent,  and  the  relationship  between  the  coding  method  and 
the  error  probability  is  not  clear. 
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Two  survey  papers  on  coding  techniques  for  baseband  systems  have  been 
authored  by  Kobayashi[6]  and  CroisierL7].  These  cover  the  majority  of  codes 
which  have  found  practical  use,  including  the  correlative-level  encoding  method 
(also  known  as  partial  response),  and  sequence-state  methods. 

1 .3  Outline  &£  iHs.  Thesis 

In  Chapter  2,  the  appropriate  models  for  the  PCM  transmission  line  with 
regenerative  repeaters  are  presented  and  the  problem  of  determining  the 
statistics  of  the  intersyrnbol  interference  is  discussed.  We  also  introduce  a 
finite-state-machine  model  for  a general  class  of  codes  which  includes  all  known 
codes  of  practical  interest.  A method  for  determining  the  necessary  statistics 
for  the  intersyrnbol  interference  for  this  class  of  codes  is  shown,  and  some 
assumptions  about  these  statistics  are  discussed.  An  important  parameter  of  the 
encoded  sequence,  the  digital  sum,  is  also  introduced. 

In  Chapter  3,  we  derive  the  expressions  for  the  error  probability  and  apply 
then  to  a first  order  model  of  intersyrnbol  interference.  This  model  is  a 
departure  from  the  usual  assumption  of  a finite  duration  channel  response.  In 
fact,  the  focus  of  our  development  is  the  nature  of  the  intersyrnbol  interference 
statistics  when  the  effective  channel  response  is  longer  than,  say,  50  symbols. 
We  illustrate  the  error  probability  results  for  several  known  codes  under 
various  channel  conditions.  A study  of  the  limiting  case  of  the  first  order 
channel  as  the  response  duration  (or  memory)  approaches  infinity  is  presented  in 
Chapter  4.  We  will  find  that  the  intersyrnbol  interference  possesses  a 
multimodal  probability  density  for  channels  with  a slow  response.  We  show  that 
assuming  the  probability  density  of  the  intersyrnbol  interference  as  a sum  of 
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Some  simulation  results  we  present  also  provide  evidence  for  this  assumption. 
Finally,  a decision  feedback  equalization  scheme  which  removes  the  multimodal 
nature  of  the  intersymbol  interference  is  discussed. 

We  consider  the  problem  of  synthesizing  codes  for  the  control  of 
intersymbol  interference  in  Chapter  5.  After  a preliminary  discussion  on  the 
general  constraints  for  the  synthesis  of  this  class  of  codes,  we  develop  some 
ideas  for  designing  codes  to  minimize  the  error  probability  in  the  presence  of 
the  multimodal  intersymbol  interference,  and  show  an  example  of  a construction. 
We  also  show  an  example  of  code  construction  for  a system  with  the  decision 
feedback  equalization  scheme  introduced  in  Chapter  4. 

We  conclude  this  thesis  with  some  considerations  of  the  decoding  procedure 
for  these  codes  in  Chapter  6.  We  also  discuss  error  detection  schemes  for  the 
decoder.  Finally,  a known  construction  of  ternary  error  correcting  codes  is 
used  to  demonstrate  the  existence  of  finite  state  machine  codes  which  correct 
single  errors  but  require  only  moderate  redundancy. 


CHAPTER  2 


BASEBAND  DATA  TRANSMISSION  SYSTEMS 


2.1  System  Model 

Since  we  are  interested  in  analyzing  the  performance  of  encoding  techniques 
for  baseband  systems,  it  is  worthwhile  to  begin  with  a discussion  of  the  overall 
system  model.  Fig.  1 illustrates  the  gross  model  for  a baseband  system  with 
regenerative  repeaters. 


repeaters 


Figure  1 . Overall  Model  of  Repeatered  Line 

The  source  terminal  emits  the  information  signal  in  a form  which  is  suitable  for 
the  transmission  medium  which  is  typically  a twisted-pair  or  coaxial  cable. 


Fig.  2 illustrates  an  appropriate  model. 


Figure  2 . Source  Terminal 

In  this  analysis,  we  shall  assume  that  the  information  sequence,  bn,  is 
binary.  The  line  encoder,  whose  structure  is  left  unspecified  for  the  time 
being,  simply  translates  the  binary  sequence  into  a stream  of  impulse  functions 

"1Ch  “ J 

" 

x(t)=  ^ak5(t-kT),  (2.1) 

k»  -« 

where  a^  is  an  L-ary  symbol^,  and  1/T  is  the  data  rate.  The  sequence,  {a^}, 
shall  be  called  the  channel  sequence.  The  signal,  x( t) , then  passes  through  the 

| 

linear  system,  G(  w ) to  produce  the  modulated  signal 


<3 


I 


s(t)=  l akg(t-kT), 

k*  _ oo 


(2.2) 


which  is  a repetitive  sequence  of  translated  causal  functions,  g(t),  having  the 


(1) 


The  elements  of  the  L-ary  alphabet,  A , shall  be  considered  as  numbers  such 


that  A ={  |i  |$  L -2}  for  L odd,  andA={±(2£  -1  )/2:i  = 1 , . . . ,L/2 } for  L even. 


I 


I 


0 


D 


multiplier  for  t 2-kT.  The  modulated  signal,  s(t),  is  transmitted  through  the 


cable  to  the  regenerative  repeater,  We  shall  model  the  transmission  medium  as  a 


linear  system  having  the  frequency  response,  C(  “ ) , and  correspondingly  its  time 


response,  c(t),  which  is  the  inverse  Fourier  transform  of  C(ui  ) 


A repeater  model  for  this  system  is  shown  in  Fig.  3«  In  this  illustration 
L(  w ) is  a linear  system  (including  amplification)  which  filters  the  incoming 


signal,  and  the  timing  circuit  extracts  information  from  the  received  signal  so 


that  the  sampler  is  synchronized  to  the  symbol  stream.  The  decision  element 


which  for  cost  reasons  is  usually  a threshold  slicer,  produces  an  estimate,  a 


of  the  symbol  a^.  at  every  time  k.  The  estimated  sequence,  a 


is  then  passed 


through  the  pulse  shaping  filter,  G(ui  ),  which  is  identical  to  that  in  Fig.  2 


The  signal  s(t)  is  now  transmitted  through  the  next  section  of  cable  until  it  is 


processed  by  the  next  repeater 


SAMPLER 


DECISION 


Figure  3.  Repeater  Model 


For  our  purposes,  it  is  convenient  to  consider  the  cascade  of  G(  tn  ) , C(  <*>  ) 


as  a single  element  in  the  model.  The 


and  the  repeater  front-end 


motivation  for  this  lies  in  our  interest  to  study  the  relationship  between  the 


line  encoder  and  the  estimated  symbol  sequence 


ht) 

G(w) 

10 


The  input  signal  to  the  sampler  is  also  corrupted  by  noise,  S (t),  which  we 
shall  model  as  fchite  Gaussian  with  zero  mean  and  variance  equal  to  Nq/2 . These 
considerations  allow  us  to  model  the  first  link  in  the  total  system  by  the  block 
diagran  in  Fig.  4. 


LINE 

ENCODER 


). 

Q AUDI  CD 

rk 

DECISION 

A 

•k 

r 

wMWIrLCn 

Figure  4.  Simplified  model  of  the  first  link 

Note  that  we  have  omitted  the  timing  circuit  from  the  simplified  model.  For 
this  analysis,  we  assume  that  the  sampler  operates  perfectly.  This  is  a major 
simplifying  assumption  since  timing  considerations  are  very  important  for  these 
systems.  Nevertheless,  this  assumption  allows  us  to  simplify  our  model  further. 
Frco  Fig.  4 we  see  that 


r(t)=/x(T  )h(t-x  )dx  + 5(t)=  l akh(t-kT)  +C(t). 

— * 00 

k»-  CO 


Thus,  after  the  sampling  operation  we  have 

oo 

r^srCkT+tg):  I ajh[  (k-j)T+tg]-*-  5(kT+tg) 

j a -oo 


(2.3) 


(2.4) 


0 1 
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where  tg  is  a parameter  in  the  timing  circuit.  Consequently,  we  can  consider 
the  system  in  Fig.  4 equivalent  to  a digital  system.  This  simplification  leads 
to  the  system  as  shown  in  Fig.  5, 


s 


Figure  5.  Equivalent  digital  model 

where  the  digital  system,  Hte^  ),  is  derived  from  the  cascade  of  G(  w ),  C(  oj), 
and  L(  oj  ) by  the  familiar  "folding"  operation. 

Whenever  a^Ja^,  we  say  that  a symbol  error  has  occurred;  furthermore,  the 
probability  of  occurrence  of  symbol  errors  is  the  probability  of  error.  P(E). 
We  shall  consider  P(E)  to  be  the  most  important  indicator  of  system  performance; 
hence  the  objective  of  our  analysis  is  to  determine  how  to  minimize  P(E). 

Errors  in  PAM/PCM  systems  are  attributable  to  various  causes.  The  linear 
channel,  H(eJw),  produces  a time  dispersed  sequence  whereby  the  symbol  ak  at  a 
given  time  affects  the  received  symbol  sequence,  rj,  for  j>k.  This  is 
intersymbol  interference  (I. I.).  In  addition,  the  Gaussian  noise  sequence,  , 
further  corrupts  the  dispersed  signal.  The  effect  of  the  combination  of  these 
elements  (or  either  one  singly)  may  be  sufficient  to  override  the  operation  of 
the  decision  element,  thereby  causing  an  error.  These  two  causes  of  error 
constitute  the  topic  of  this  thesis.  Other  causes  of  error  which  are  not 
considered  here  are  timing  errors,  cross-talk,  and  non-linearity  in  the  actual 
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system. 


2.2  Error  Channel 


The  equivalent  digital  system  allows  us  to  express  the  received  symbol  at 
time  k as 


rk=h0ak+I  hk_jaj+  £ k* 

jf  k 


(2.5) 


We  may  consider  13q=1  without  loss  of  generality  so  that  ric=ak+ik+^k  where 


ik=  E hk-jaj 


(2.6) 


is  the  I. I. 


For  the  purpose  of  analyzing  the  relationship  between  the  line  sequence  and 
the  I. I.,  it  is  convenient  to  substitute  an  equivalent  model  for  the  linear 
system,  H(eJw),  in  Fig.  5.  At  every  time  k,  the  desired  signal  at  the  detector 
is  ak,  and  the  interfering  signal  is  the  response  of  H(eJw  ) to  the  previous 
symbols.  We  can  represent  the  impulse  response  of  H(e^“  ) as 


0;  J<0 
i;  J=o 

J>o  . 


(2.7) 


We  can  now  view  H(e^  u ) as  the  parallel  arrangement  of  a feedforward  connection 
and  a linear  system  having  the  impulse  response 


■■n  ■ 
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0;  j<o 

hj;  J>0. 


(2.8) 


We  shall  refer  to  the  system  having  this  impulse  response  as  the  error  channel . 

By  means  of  this  model,  the  I. I.  and  desired  signal  are  separated  in  a 
concaptually  simple  manner,  as  illustrated  in  Fig.  6. 


Figure  6.  Error  Channel 

We  must  note  that  we  are  assuming  that  a channel  symbol  does  not  cause  I. I.  for 
any  past  symbols  (i.e.,  the  error  channel  is  causal),  although  in  a practical 
system,  this  may  not  be  the  case.  Generally,  the  nonlinear  phase  characteristic 
of  the  channel  causes  the  pulse  to  exhibit  a response  before  the  peak  of  the 
received  signal.  This  response  interferes  with  past  symbols  because  the  samples 
of  the  received  signal  are  taken  near  the  peak  of  the  response.  The  non-causal 
response  of  the  error  channel  arises  because  the  inherent  linear  delay  in  the 
system  is  not  a factor  in  the  analysis.  However,  in  such  a system  a symbol  will 
interfere  with  a finite  number  of  past  symbols  but  may  interfere  with  an 


*7T. 
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I 


I 
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infinite  number  of  future  symbols.  The  analysis  that  follows  will  concentrate 
on  the  latter  case. 


2.3  £mu:  Probability  Mathematical  Difficulties 

We  are  interested  in  calculating  P(E)  for  the  system  model  given  in  Fig.  5. 

A 

Wien  hjrO  for  all  j (i.e.,  there  is  no  I. I.),  the  calculation  is  straightforward 
and  we  obtain  the  familiar  dependence  of  P(E)  on  the  signal-to-noise  ratio.  The 
conditions  on  HCe^  ) which  ensure  hj=0  are  the  well  known  Nyquist  criteria. 

Wienever  the  I. I.  is  not  zero,  the  calculation  of  P(E)  is  complicated  by 
its  dependence  on  the  statistics  of  the  I. I.  If  the  probability  density 
function  (or  distribution  function)  were  easily  calculated,  it  would  be  a simple 
matter  to  calculate  P(E)  for  a given  system  by  convolving  the  Gaussian  density 
with  the  I. I.  density.  Unfortunately,  the  calculation  of  the  I. I.  probability 
distribution  is  quite  a difficult  problem.  In  fact,  very  little  can  be  said 
about  the  probability  distribution  in  general.  In  [8],  a survey/tutorial  of 
results  is  presented  on  the  problem.  If  the  error  channel  has  a finite 
response,  then  it  is  clear  that  the  probability  distribution  is  a staircase 
function,  since  the  I. I.  may  take  on  a finite  number  of  values  only.  However, 
when  the  error  channel  has  an  infinite  response,  the  distribution  may  or  may  not 
be  caitinuous,  depending  on  the  rate  of  decay  of  the  response.  Furthermore, 
even  if  the  distribution  is  continuous,  it  may  not  be  differentiable  (so  that 
the  density  may  not  exist).  Despite  closed-fonu  results  which  have  been  found 
for  some  special  cases,  the  problem  remains  open  and  the  possibility  of  simple 
results  seems  remote.  From  the  viewpoint  of  applications,  therefore,  numerical 
methods  for  finding  P(E)  appear  to  be  necessary. 


J 
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In  the  past  decade,  much  effort  has  been  expended  on  finding  upper/lower 
bounds  and  approximations  for  P(E),  and  the  available  results  provide  a myriad 
of  techniques  of  varying  complexity  and  accuracy.  A survey  of  the  pre-1975 
literature  in  this  area  is  given  in  [9].  Later  we  shall  utilize  one  such  method 
which  is  an  approximation  of  P(E)  based  on  an  infinite  series  representation  of 
the  joint  I.I.  and  noise  probability  distribution  (Gram-Charlier  series).  This 
method  requires  knowledge  of  the  moments  of  the  I.I.  only,  which  may  be  found 
recursively  without  an  analytical  expression  for  tne  distribution  function. 
Before  discussing  the  application  of  this  technique,  we  shall  present  a 
description  of  a broad  class  of  line  encoders  which  encompasses  most  situations 
of  practical  interest. 

2 . 4 Finite  £.tate  Machine  Lins.  Encoders 

In  the  description  of  the  overall  system  model,  we  did  not  indicate  the 
structure  of  the  line  encoder  (Fig.  2).  The  subsequent  discussion  of  the  P(E) 
will  show  that  the  statistics  of  the  I.I.  are  strongly  dependent  on  the 
statistics  of  the  channel  sequence.  A class  of  encoders  for  which  the 
statistics  may  be  evaluated  is  described  by  means  of  a finite  state  machine 
(FSM)  model.  Let  the  quintuple  {B  , L ,S  ,f,g)  denote  the  encoder  with 
8={0,1}m,  which  are  inputs  consisting  of  sequences  of  m binary  symbols  (input 
blocks),  LCAn,  outputs  consisting  of  sequences  of  n L-ary^  symbols  (output 
blocks  or  codewords),  and  S = { 1 ,2 , . . . , r}  , a finite  set  of  states  conveniently 
denoted  as  integers.  The  output  function,  f:8*S-*L  , can  be  described  as  a set 

^See  footnote  ( 1 ) . 
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of  functions  F={f1 ,f2, . . . ,fr) , where  f B+L 


The  mapping  f^  shall  be 


referred  to  as  the  encoding  mode  corresponding  to  state  i.  The  state  transition 
function,  g:BxS->-  S , determines  the  state  of  the  encoder  at  all  times  given 
that  the  initial  state  is  known. 


The  input  block  sequence  is  assumed  to  be  stationary  with  each  block 

statistically  independent.  For  each  bu;  u=1 , . . . ,2m=K,  its  associated 

probability  will  be  denoted  9U>  with  the  obvious  condition 
K 

I eu  =1- 
U=1 

The  encoded  process  (channel  process)  may  be  represented  as  in  Fig.  7 below. 


_(1)_(2) — _(n) | _( 1 )_(2 . 
a 0 a 0 3 0 I a1  ai 


i^Pa^- ' ’a^| 


Figure  7.  Channel  Sequence 


This  representation  shows  the  symbol  sequence  properly  framed  with  respect 
to  the  words  of  length  n.  The  symbol  ak  represents  the  1th  digit  of  the  kth 
codeword  in  the  channel  sequence.  We  will  refer  to  the  time  of  transition  from 
word  to  (i.e.,  the  symbol  transition  a^  to  a^)  as  wordtime  i.  Note 
that  the  sequence  representation  explicitly  shows  the  state  sequence  of  the 
encoder.  We  shall  also  adopt  the  convention  that  the  state  transition  from  si_1 
to  s^  occurs  at  wordtime  i. 
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At  wordtime  i,  a new  binary  blc  . is  input  to  the  encoder  and  the  state 
transition  function  changes  the  internal  state  of  the  encoder  according  to 
si=S(bi-l.si-i)(3)-  Assuming  that  the  binary  input  process  is  block  stationary 
with  statistically  independent  elements,  it  easy  to  show  that  the  state  sequence 
is  a homogeneous  Markov  chain  with  stationary  transition  probabilities  (see, 
e.g.,  [10]).  Let  be  the  set  of  input  words  which  cause  the  encoder  to 

change  from  state  i to  state  j (i.e.,  8ij={be8  |j=g(b,i)}),  and  denote  the 
one-step  transition  matrix  of  the  state  process  as  n=  [|  ir^j  [|  , then 

*ij  - l eu  (2.9) 

bueSij 

A restriction  on  n (hence,  on  the  state  transition  function,  g(b,s))  is  that 
the  state  process  must  be  a fully  regular  homogeneous  Markov  chain.  For  this 
class  of  state  transition  matrices,  there  exists  ^ =j^jj  II k where  !Ik  denotes 
the  matrix  of  k-step  transition  probabilities.  Furthermore,  n„  consist*’  of 
identical  rows  j2=(p^  ,P2»  . • • ,Pr)  such  that^II=£.  The  vector  £ is  called  the 
stationary  probability  distribution.  This  restriction  does  not  affect  the 
application  of  the  model  to  practical  situations. 


A subclass  of  the  FSM  encoders  which  has  useful  properties  results  from 
considering  a specific  form  for  the  state  transition  function,  g.  Let 
A=(a( 1 ^ ,a(2  ^ , . . . ,a(n^ ) be  a codeword  and  define  the  characteristic  of  .a,  y (a)  , 
as 


(^Here  and  hereafter,  s^  and  b^  refer  to  the  state  and  input  at  wordtirae  i 
respectively. 
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n 


i=1 


where  the  sum  is  real.  Furthermore,  we  assume  there  is  a function  a;  5 -+■  £ 
(Z  is  the  set  of  integers)  such  that  for  every  seS  , and  b e & , we  have 

o [g(b,s)]=  a(s)+  y[f(b,s)].  (2.10) 

Expression  (2.10)  means  that  the  image  of  the  next  state  under  a is  equal  to 
the  image  of  the  present  state  plus  the  characteristic  cf  the  present  codeword. 
Codes  with  this  property  have  been  called  counter-encodable  (CE)  [11]  because 
the  encoder  can  be  implemented  with  a digital  counter  which  contains  the  value 
a (s)  for  the  current  state  s. 


A simple  example  of  a ternary  CE  code  is  the  Paired  Selected  Ternary  (PST) 
[12]  which  is  described  by  the  codebook  in  Table  1. 


Table  1 . Codebook  for  PST 


u 

bu 

state  1 

state  2 

1 

00 

+- 

+- 

2 

01 

- + 

- + 

3 

10 

+0 

-0 

4 

1 1 

0+ 

0- 

We  note  from  Table  1 that  the  code  naps  blocks  of  2 binary  digits  (m=2)  into 
blocks  of  2 ternary  digits  (n=2),  and  that  there  are  two  encoding  modes 
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corresponding  to  the  two  states  (r=2).  We  shall  refer  to  an  FSM  code  as  an 
(m,n,r)  code  whereby  PST  is  a (2,2,2)  code.  Note  that  state  1 contains  two 
words  each  with  characteristic  equal  to  zero  (viz.  +-  and  -+)  and  two  words 
each  having  characteristic  equal  to  one  (viz.  +0  and  0+) . Likewise  in  state  2 
there  are  2 words  having  characteristic  equal  to  zero  and  2 words  having 
characteristic  equal  to  minus-one.  As  an  example  of  the  encoder  operation,  if 
at  wordtime  n,  the  encoder  is  in  state  1 and  the  binary  block  '10'  is  input,  the 
encoder  will  output  the  block  *+0'  and  change  to  state  2.  The  function  a in 
this  case  is  simply  c(i)=i;  i=1,2.  If  the  input  blocks  are  assumed  to  be 
equiprobable,  the  state  transition  matrix  is 


1/2 

1/2 

1/2 

1/2 

2 . 5 Statistics  lDtersyabQ.1  Interference  .fan  ESH  Calcs. 

The  previous  discussion  of  the  encoder  structure  has  assumed  that  the 
binary  input  process  is  block  stationary.  The  state  process,  therefore,  is  also 
stationary  since  s^ =g( b^ ,3^ , and  the  codeword  process  is  block  stationary 

i 

since  .a^rf  (b^s^) . However,  we  are  interested  in  the  channel  symbol  sequence 
statistics.  When  the  channel  sequence  is  produced  by  a block  encoder,  the 
strean  of  weighted  impulses  which  is  transmitted  through  the  channel  (see  (2.1)) 
may  be  represented  as 

" n 

x(t)=  l l a^  <S[ t-kn- (1-1 ) J (2.11) 

k=-«i=1 


‘ ■ 
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where  the  index  k indicates  the  codeword  position  in  the  sequence,  and  the  index 
1 indicates  the  codeword  digit  (phase).  The  process  (2.11)  is  cyclostationary 
with  period  n because  the  codeword  sequence  is  stationary. 

When  the  discrete  process  (2.11)  is  applied  to  the  error  channel,  the 
resulting  I. I.  will  be  cyclostationary  with  period  n in  the  steady  state.  Thus 
we  can  view  the  I. I.  as  block  (or  vector)  stationary.  This  allows  us  to  derive 
the  statistics  of  the  I. I.  by  considering  the  statistics  of  a single  block. 

Let  i^ , i^ i^)  denote  the  vector  which  corresponds  to  the 

kth  codeword,  ,ak  (i.e.,  each  component  of  the  vector  interferes  with  the 
corresponding  symbol  in  the  codeword).  The  joint  probability  distribution  for 
the  1. 1.,  denoted  by  Fj(  a )=Prob(ilc < a.) » is  a function  of  the  statistics  of  the 
set  of  possible  codeword  sequences,  A=gk.gk_.|  f . . . f which  we  denote  by  A . When 
the  encoder  is  an  F34  we  may  conveniently  partition  the  set  A into  r subsets.  1 

A . , which  s re  defined  as 

a j 

Ai=  { A i sk+i=i}-  (2*12) 


The  event  Ai  is,  therefore,  identical  to  the  event  s^si,  and  the 


probability  distribution  function  of  the  I.I.  is 


i 

I 

j 

I 

I 


r 

Fi(a  )=  l Fj(  a|  sk+1=i)P(sk+1=i) , 
i=1 


(2.13) 


which  is  well  defined  since  the  state  process  possesses  a steady  state 
distribution.  The  probability  distribution  may  be  calculated  with  a finite  sum 
as  a result  of  the  encoder  structure. 
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Of  course,  (2.13)  does  not  indicate  how  to  find  the  conditional  probability 
distributions  Fj(a|  s=i).  Rather  than  pursue  this  line  of  analysis,  we  shall 
apply  this  simplification  to  the  case  of  CE  codes  in  the  following  section. 


2.6  piK.Ua],  Sum  Process  and  ££  codes 


Thus  far  we  have  discussed  the  channel  sequence  and  the  I. I.  sequence  as 
stationary  vector  processes.  In  the  discussion  of  the  system  model,  it  was 
noted  that  the  decision  element  in  a regenerative  repeater  is  usually  a simple 
symbol-by-symbol  threshold  detector.  Therefore  we  need  to  develop  the 
statistics  for  the  symbol  process  rather  than  the  vector  process.  We  have  noted 
in  the  previous  section  that  the  joint  distribution  of  the  codeword  process  is 
specified  by  conditioning  on  the  encoder  state.  We  also  know  that  the  current 
I. I.  vector  is  specified  by  the  current  state  (i.e.,  it  is  statistically 
independent  of  the  previous  codewords  if  the  state  is  known).  However,  we  have 
not  addressed  the  problem  of  finding  the  conditional  distribution  Fj(iJ  sk) . In 
this  section,  we  shall  narrow  our  analysis  to  CE  codes,  and  propose  some 
simplifying  assumptions  about  the  symbol  sequence  statistics.  These  assumptions 
will  allow  us  to  derive  conditional  distributions  which  are  closely  related  to 
Fjdklsk)  a straightforward  manner. 

For  the  remainder  of  this  analysis  we  shall  consider  only  CE  codes.  A 
useful  quantity  in  the  study  of  CE  codes  is  the  Digital  Sum  (DS).  For  the 
channel  (symbol)  sequence,  . . ,a_1 ,aQ ,a1 , . . . , the  DS  is  defined  at  each  time  k as 

. 

k-1 

ak*  l a j , (2.14)  ^ 


j 
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where  the  sum  is  real.  Recall  that  in  the  definition  of  a CE  code,  (2.10),  the 
change  in  the  encoder  state  was 

a(sk+1)  - a(sk)  = l a(i),  (2.15) 

i*1 

where  is  a function  from  the  state  set  to  the  integers.  From  (2.14)  and 
(2.15)  we  see  that  the  image  under  a of  the  state  at  word  time  k differs  from 
the  DS  at  that  time  by,  at  most,  an  additive  constant;  that  is 


(2.16) 


with  <?( 1 ^ denoting  the  DS  associated  with  the  1st  digit  of  the  (k+1)st  codeword 
k+1 

in  the  sequence.  Clearly,  the  DS  process  for  a CE  code  is  closely  related  uo 
the  state  process  of  the  encoder.  Letting  d be  the  number  of  DS  values  that  may 
occur  and  recalling  that  r is  the  number  of  encoder  states,  it  is  obvious  that 
d * r,  since  does  not  necessarily  equal  a(j)  for  any  j = 1,...,r  when  i>2. 
We  may  transform  any  CE  codebook  into  an  equivalent  DS  table.  As  an  example,  we 
show  the  DS  table  for  PST  in  Table  2 (see  codebook  in  Table  1). 


Table  2. 

DS  table 

for  PST 

u 

bu 

state  1 

state  2 

1 

00 

01 

12 

2 

01 

01 

10 

3 

10 

01 

10 

4 

11 

00 

1 1 

In  this  case  we  have  chosen  C =1  (recall  that  o(i)=i,  i=1,2).  Note  that  in  the 


a(i)-1  for  all  u . We  also  find  that  even  though  there 


digit  slot  a 


are  two  states  in  the  encoder,  the  DS  may  take  on  four  different  values  (viz 


In  order  to  determine  the  statistics  of  the  I. I.  for  a CE  code,  it  will  be 


useful  to  introduce  some  ideas  about  the  statistics  of  the  DS  process.  Since 


the  channel  sequence  may  be  placed  in  a 1-1  correspondence  with  the  DS  process 


the  DS  process  is  also  cyclostationary  with  period  n.  At  this  point,  we 


introduce  a simplifying  assumption.  Let  a 


denote  the  DS  value 


corresonding  to  the  ks  digit  of  the  codeword  which  is  the  image  of  the  input 


word  b , when  the  encoder  is  in  state  i.  For  example,  in  Table  2 we  find  that 


1 for  the  PST  code.  We  shall  define  the  DS  symbol  process 


with  the  probability  mass  function  (pmf) 


denotes  the  qfc  component  of  the  DS  vector.  This  pmf  may  be  derived 


where  o 


from  the  DS  table  by  averaging  over  the  states  and  inputs  as  the  following 


where 


0;  otherwise 
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is  a counting  function.  The  probabilities,  and  6u  , have  been  defined  in 
section  2.4.  Our  definition  of  the  stationary  symbol  process  may  be  arrived  at 
by  considering  the  phase  of  the  actual  symbol  process  as  a random  variable  which 
is  independent  of  the  symbol  process  with  the  probability  of  any  phase  being 
1/n. 

I 

The  transition  probabilities,  P[  0 t=J lct_i=k] , will  be  defined  in  terms  of 
the  Joint  process,  { ot,at}.  Since 


CTt=  at-1+at-1’ 


(2.20) 


it  is  clear  that 


Pt(  ot=j),(  ot_i=k)]=  P[(at_i=j-k) ,(  a t_i =k) ] . (2.21) 

We  may  combine  the  codebook  and  DS  table  for  any  CE  code  into  a Joint  code 
symbol/DS  table  by  inserting  the  pair  ( a a^j)  into  the  appropriate 
position  of  the  codebook.  For  the  PST  code  (Table  1)  with  DS  table  in  Table  2, 
we  obtain  the  joint  DS/symbol  table  as  shown  below  in  Table  3. 


Table  3.  Joint  symbol/DS  table  for  PST 


u 

bu 

state  1 

state  1 

1 

00 

(0,+) ,(1 ,-) 

( 1 ,+) , (2 ,-) 

2 

01 

(0, -),(!,♦) 

U,-),(0,+) 

3 

10 

O 

o 

o 

o 

1 

4 

11 

o 

o 

w 

o 

v 1 ,0 ) , ( 1 ,-) 
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For  the  PST  code  (moreover,  for  any  ternary  code),  it  is  clear  that 
P[  °t=j  1 0 1_ -j  =i ] is  non-zero  for  je  {i+l,i,i-1}  only. 

The  joint  probabilities  in  (2.21)  may  be  calculated  from  (2.18)  by 
redefining  the  counting  function  as 


(q) 

iu 


(fc,  j) 


f’! 


and  ci^sj-k 
iu  iu 


^0;  otherwise 


(2.22) 


The  DS  transition  probability  is  then  easily  calculated  from  the  joint 
probabilities  (2.21)  and  the  pmf  of  the  DS  (2.18).  We  define  the  matrix  of  DS 
transition  probabilities  as  Q=  II  Qjj  II  '^here  qj_j=Ptat  =j|ot_=i]. 

We  now  propose  a simplifying  assumption  concerning  the  I. I.  symbol 
process.  The  motivation  for  this  assumption  may  be  found  by  recalling  that  the 
vector  I. I.  probability  distribution  may  be  specified  in  terms  of  the  encoder 
state.  For  a CE  code,  (2.16)  indicates  that  this  is  equivalent  to  saying 


F (i(^}  | st=j)=FI(i(^)  | o(1)»  a(j)).  (2.23) 

In  words,  (2.23)  means  that  the  probability  distribution  for  the  first  component 
of  the  I. I.  vector  is  specified  by  the  DS  that  corresponds  to  that  symbol  time. 
Of  course,  this  is  not  true  for  the  other  components  of  it>  Because  the  DS  will 
always  be  finite  for  a CE  code  with  a finite  number  of  states,  it  may  be 
utilized  as  a conditional  event  for  the  I. I.  symbol  process  in  the  same  manner 
as  the  encoder  states  may  be  used  for  the  vector  process.  The  cyclostationarity 
of  the  I. I.  symbol  process  requires  the  computation  of  the  I. I.  probability 
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distributions  fbr  each  of  the  n phases.  However,  we  may  define  a stationary 
randan  process  based  on  the  cyclostationary  I. I.  process  by  averaging  the 
conditional  probabilities  (conditioned  on  the  DS  and  phase)  over  the  phases 
(assuming  each  phase  is  equiprobable) . We  expect  that  the  statistics  of  this 
stationary  process  will  be  similar  to  those  of  each  phase  of  the  I. I. 
cyclostationary  process  (i.e.,  we  are  assuming  that  each  phase  of  the  I. I. 
conditioned  on  the  DS  have  approximately  the  same  statistics).  Hence,  for  the 
remainder  of  this  analysis,  we  will  assume  that  the  probability  distribution  of 
the  I. I.  is  independent  of  the  codesymbol  process  when  conditioned  on  the  DS. 
In  Chapter  4,  we  will  present  analysis  and  results  which  yield  evidence  for  the 
validity  of  this  assumption. 

Based  on  these  assumptions  we  may  develop  the  statistics  of  the  I. I. 
process.  Following  the  development  in  section  2.5,  we  let  A denote  an  infinite 
sequence  of  channel  symbols,  a^.a^  , . . . , and  A the  set  of  all  such  sequences. 
We  can  partition  the  set  A into  d subsets  A^  where 

k-1 

Ai=  { A|  i=  l aj}  = { A | ok=i}.  (2.24) 

j=-°° 

Thus,  the  probability  distribution  function  of  the  I. I.  based  on  the  stationary 
DS/channel  symbol  process  may  be  expressed  as 

d 

Fj(a  )=  £ F j ( a | o(c=i)P(  a^si).  (2.25) 

i=1 

Expression  (2.25)  indicates  that  for  CE  codes,  the  I. I.  distribution  may  be 
found  by  conditioning  on  a finite  number  of  events.  The  topic  of  the  next 
chapter  will  be  the  calculation  of  the  conditional  distributions  Fj(  a ! 0 ). 
One  must  keep  in  mind  that  when  we  discuss  these  distributions,  we  are  referring 


to  the  I. I.  process  which  results  from  the  assumptions  made  in  this  sectjon. 


2.7  System  Model  Revisited 

In  section  2.1,  we  began  with  an  overall  model  of  a repeatered  line.  The 
source  terminal,  and  repeater  models  were  discussed.  The  remainder  of  this 
chapter  was  concerned  with  the  preliminaries  to  evaluating  the  error  performance 
of  the  first  link  in  Fig.  1.  We  have  seen  that  the  statistics  of  the  line 
encoder  are  of  paramount  importance  in  determining  the  statistics  of  the  I. I. 

The  remaining  links  in  the  repeater  chain  are  identically  modeled.  However 
when  evaluating  the  I. I.  statistics  for  the  remaining  links  it  should  be  noted 
that  we  must  assume  that  no  errors  have  occurred  in  the  previous  links.  If 
errors  have  occurred,  the  statistics  of  the  line  sequence  in  the  succeeding 
links  will  be  different  and  the  results  of  the  analysis  invalidated.  The  error 
rate  in  these  systems  is  typically  very  low,  hence,  this  assumption  should  not 
cause  much  inaccuracy  in  the  results. 

The  destination  terminal  in  Fig.  1 has  not  been  discussed.  Its  digital 
model  is  shown  in  Fig.  8. 


Figure  d.  Model  for  destination  terminal 


It  is  seen  to  be  identical  to  the  repeater  model  with  the  exception  of  a line 


decoder  which  converts  the  estimate,  ak,  to  the  binary  estimate,  bt,  of  the 
original  information  sequence  {btJ.  When  bt$bt,  a bit  error  is  said  to  occur. 
The  relationship  between  the  symbol  error  probability  and  the  bit  error 
probability  will  be  discussed  later  along  with  a discussion  of  the  line  decoder 
structure  for  F31  codes.  A final  note  on  the  destination  terminal  concerns  the 
decision  device.  Although  the  repeaters  require  simple  detectors  for  economic 
and  space  reasons,  the  terminal  decision  device  may  be  somewhat  more  complex. 
In  addition,  in  order  for  the  decoder  to  operate  properly  on  block  codes,  the 
symbol  stream  must  be  frame-synchronized.  We  will  discuss  these  aspects  of  the 
decoding  process  in  a later  chapter. 
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CHAPTER  3 

FIRST  ORDER  MODEL  OF  INTERSYMBOL  INTERFERENCE 


3.1  Preliminary  Remarks 


Fran  the  discussion  in  section  2.3,  we  concluded  that  numerical  methods  are 
necessary  to  find  the  P(E)  caused  by  Gaussian  noise  and  1. 1.  A convenient 
method  developed  independently  by  Ho/Yeh  [13]  and  Shimbo/Celebiler  [14]  utilizes 
an  infinite  series  representation  of  the  Gaussian  density,  known  as  the 
Gram-Charlier  series  (see,  e.g.,  [15]).  The  result  is  an  approximation  to  P(E) 
by  truncating  the  series.  The  approximation  may  be  made  arbitrarily  accurate  by 
including  more  terms  in  the  calculation.  This  method  is  convenient  because  only 
the  moments  of  the  1. 1.  are  needed  in  the  series,  and  these  moments  may  be 
found  recursively.  In  the  above  references  the  channel  sequence  was  assumed  tc 
be  a sequence  of  independent  and  identically  distributed  binary  random 
variables.  Ho  and  Yeh  [16]  extended  this  to  include  multilevel  sequences.  In 
addition,  the  authors  have  assumed  that  the  error  channel  has  a finite  impulse 
response.  In  [17],  Cariolaro  and  Pupolin  extended  the  recursive  method  of 
calculating  the  1. 1.  moments  to  include  channel  sequences  generated  by  an  FSM 
encoder;  they  also  assumed,  however,  that  the  error  channel  has  a finite 
response.  In  this  chapter,  we  shall  develop  a method  of  recursively  calculating 
the  1. 1.  moments  for  a first  order  channel  with  an  exponential  response 
(recursive  channel)  for  CE  codes.  We  shall  take  advantage  of  the  conditioning 
on  the  DS  as  developed  in  section  2.6. 

The  error  probability  will  be  calculated  for  a fixed  threshold  detector. 
When  L is  odd,  the  threshold  detector  may  be  described  by  the  functional 
relationship 


t 
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L-2;  rk>(2L-5)/2 

i;  (2i-1  )/2«  rk<  (2i+1  )/2,  k=0 ,±1 , . . . ,±(L-3 ) 

-(L-2);  rk<-(2L-5)/2  . 


(3.D 


We  may  define  a aimilar  relationship  when  L is  even,  but,  for  the  remainder  of 
this  thesis  we  shall  focus  on  the  ternary  case,  (L=3).  For  the  ternary  case 
(3.1)  beoanes 


1;  rk>  1/2 

*k-  i 0;  -1/2<  rk<  1/2 
-1;  rk<  -1/2  . 


(3.2) 


It  is  clear  from  (3.2)  that  the  threshold  detector  will  make  an  error  whenever 


i)  rk  < 1/2,  and  ak=1 
ii)  rk»1/2  or  rk«  -1/2,  and  ak=0 
iii)  rk>-1/2,  and  ak=-1  , 


(3.3) 


Recalling  that  rk=ak+ik+Ck  , we  may  express  the  error  probability  as 


p(ik+(  k'i/2);  v1 

P(E)=  ^P(  ik+c  k>  1/2)+P(ik+£  k<  -1/2);  ak=0 
P(ik+  5k>  1/2);  ak=-1  . 


(3.4) 


At  this  point,  it  is  important  to  note  that  since  the  channel  symbol  sequence 
and  the  I. I.  sequence  are  cyclostationary,  the  sequence  of  error  probabilities 


"V-  .sir' 


i I 


lJ 


components  of 

P(E)=  ( 1/n)  I e(J}.  (3.6) 

j=1 

It  is  clear  tnat  the  computation  of  (3.6)  requires  finding  the  statistics  of  the 
vector  I. I.  process.  However  for  CE  block  codes,  we  have  assumed  that  we  can 
determine  the  statistics  of  the  I. I.  by  conditioning  on  the  DS.  This 
simplifies  the  analysis  greatly  but  it  is  important  to  remember  that  the  error 


probabilities  that  we  derive  will  not  be  identical  to  (3.6),  except  when  the 

A- 

blocklength  n is  equal  to  one. 

I 

We  shall  find  P(E)  via  (3.4)  with  the  symbol  stationarity  assumption. 
Letting  Fi+N|a^e^a  ^=P^ik+^k<G  I ak=a  ^ where  « e {-1,0,1}  be  the  probability 
distribution  of  the  noise  plus  I. I.  conditioned  on  the  symbol,  a^r  the 
expression  for  P(E)  may  be  written  as 


P<E>=  FI+N|a(-1/2l1)P(ak=1)  + pI+N|a^ 1/2I )p(ak=-1 ) + 
«FI+N|a(-1/2!°)+7I+N|a(1/2l0^p'ak=0) 


(3-7) 
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where  F j+fj|  a(  e |a  ) = 1-Fj+jj|  a(  e |a  ).  Because  the  noise  is  independent  of  the 
symbol  sequence  and  the  I. I.,  we  need  be  concerned  with  the  I. I.  distributions, 
Fi|a(e|a  )•  This  distribution  may  be  expressed  as 


d 

FI  |a(  el<*  )=  l FI,a  | a(e  »Jl  a )p(a=a|  c=j)  = 

j=1  (3.8) 

d 

^ FI,al  a^  e >j|a 
j=  1 


where  F^a|  a(‘,‘  | *)  is  the  joint  distribution  of  the  1. 1.  and  the  DS 
conditioned  on  the  channel  symbol.  However  we  have  assumed  that  the  I. I. 
distribution  is  completely  specified  by  the  DS  at  anytime  so  that  Fj  a|a=Fj  a . 
With  this  simplification,  we  may  condition  the  error  probability  on  the  DS, 
whereby  we  obtain 


P(E  | a=j)=FI+wja  (-1/2 ] j+Qj,  j+i  J+ 

(3.9) 

fI+N|ct  (1/2I j)Cqj,j+qj,j-1]- 

The  error  probability  expression  (3.7)  may  then  be  expressed  in  terms  of  (3.9) 
as  follows 

d 

P(E)=  l P (E ] j)P(o=j)  (3.10) 

j-1 

Before  evaluating  (3.10),  it  is  necessary  to  derive  the  conditional 
distributions,  Fj|a  ( ^|j)-  In  the  next  section,  we  shall  do  this  for  a 


) 


> i 


jj 


I 


first  order  channel. 


3.2  L.I.,  X2n  ilia  First  Qrdg£  Channel 


In  this  section  we  shall  analyze  tne  statistics  of  intersymfcol  interference 
when  CE  codes  are  transmitted  through  a channel  containing  at  most  one  pole  in 
its  transfer  function.  We  assume  that  the  channel's  impulse  response  is 
described  as 


0;  j<0 
{ 1 ; j-0 

c X j"1;  j>  1 


(3.1D 


where  0 < X < 1 . This  description  of  the  channel  is  useful  oecause  it  allows  us  to 
separate  the  desired  signal  component  from  the  I. I.  easily.  This  can  be  seen 
in  tne  block  diagram  model  of  the  system  shown  in  Fig.  9. 


Figure  9.  First  order  channel. 
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The  figure  clearly  shows  that  the  received  signal  is  the  sum  of  the  desired 
symbol  and  the  I. I.  which  is  generated  by  the  error  channel  (enclosed  in  the 
dotted  box) . 

Furthermore,  by  varying  the  value  of  c we  can  approximate  different  channel 
characteristics.  In  particular,  when  c is  negative,  the  response  of  the  system 
approximates  that  of  a channel  with  a low  frequency  cutoff.  Fig.  10  illustrates 
this  situation. 


4 


Figure  10.  Approximation  of  a low  frequency  cutoff. 


n 


Our  analysis  begins  by  considering  the  conditional  distribution  of  the  I. I. 
on  the  DS,  Fj|  G (e|k).  For  the  derivation  to  follow,  it  will  be 
conceptually  simpler  to  explicily  note  the  time  dependence  on  the  distributions. 
We  shall  let 
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at?  a-~— 


Ft(  e|  k)=P(it<e|  a t=ic) , 


(3.12) 


. . 


and  consider  the  conditional  characteristic  function  of  it,  given  a t,  which  is 


Mt(ja)  ,k)=  / eJ^tdFj.Uj  k). 


(3.13) 


From  our  definition  of  the  first  order  channel,  we  may  express  it  recursively  as 


it*  X it-1+cat-1=  X it-1+c(  fft-°t-1)( 


(3. 1*0 


The  conditional  distribution  may  also  be  expressed  recursively.  Considering  the 
event,  {it<e  | at=k},  it  is  clear  that  this  event  is  equivalent  to 


U {it_1  < = ot=k}. 


(3.15) 


«.=  1 


u 

o 

0 

(1 


0 

0 


Our  assumption  that  the  DS  completely  specifies  the  distribution  of  the 
I. I.  at  any  time  allows  us  to  write  the  distribution  function  as 


d 

■ r e - c(k-Jt)  . 

Ft(e|k)  = I F ( I l)  p 

A 


(3.16) 


where  P j^Pt  o t_i -l  \ a t=k] , are  the  reverse  one-step  transition  probabilities 
of  the  stationary  DS  process.  The  reverse  transition  probabilities  of  the  DS 
process  are  found  in  terms  of  the  forward  transition  probabilities  as 


p k =qkf 


P (o=i) 


(3.17) 


P(o=k) 


■ 
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We  shall  let  R=  ||p^  j||  denote  the  matrix  of  reverse  transition  probabilities. 

Letting  n =[ e -c(k-l) J/X  , we  may  express  the  conditional  characteristic 
function  in  (3.13)  as 


Mt(>  ,k)  = l Pkl/  e-*“[Xn  ♦e(k-1)]dFt_l(n|i)s 
1=1 

l PkleJ  Wc(k'1)Mt_1(jXo)  ,1). 

1=1 


(3.18) 


We  may  disregard  the  explicit  time  dependence  because  we  are  interested  in  the 
steady  state  whereby  the  functional  equation  for  the  conditional  characteristic 
function  is, 


M(ju  ,k)=  I pkleJa>  c(k_1)M( JXm  ,1). 


(3.19) 


1=1 


We  have  mentioned  previously  that  we  will  utilize  the  moments  of  the  I. I. 
in  tfle  error  probability  calculation.  From  (3.19),  we  may  obtain  the 
conditional  moments  of  tne  I. I.  and  DS  via  the  well  known  relationship  between 
the  moments  and  the  characteristic  function  for  a random  variable.  Thus,  we 
have  (see,  e.g.,  [l8]) 


1=1 


jpE[ip|k]=  I Pkl  JL  {e^  o(k_1)M(j  Xu  ,1)} 

dio 


(3-20) 


w=0 


<1 


The  derivatives  of  the  right-hand  side  of  (3-20)  may  be  conveniently  expressed 
as 


[ jc(k-l)  + 

dm 


]pM(jXm  ,1) 


m=0 


(3.21) 


where  the  quantity  in  parentheses  is  understood  to  be  an  operator.  Furthermore, 


M(jXu)  ,1)  =jP\PuP 

dojP  id=0  1 


(3.22) 


where  =E[ip|  o =1]  is  the  pth  moment  of  the  conditional  random  variable 

{ i | a=l}.  Thus  for  any  moment  of  the  I. I.,  we  obtain  a system  of  linear 
equations  as  follows: 


d P / \ 

u P=  £ P kl  W^jtcOc-l)]1  xP-Sp;  k=l,...,d. 


1=1  i=0 


(3.23) 


We  can  cast  (3.23)  into  matrix  form  by  rearranging  terms: 


( x ~P“  p kk)_  l p kl  11 1 = 
l#k 

l Pkl  l(f)[c( k-l)]1  X-iyP-i. 

1=1  i=1 


(3.24) 


Expression  (3.24)  is  a system  of  linear  equations  in  the  pth  conditional 

k 

moments.  This  system  is  in  terms  of  ;k=0 , . . . ,p-1 , hence,  these  moments 

may  be  calculated  recursively.  In  matrix  form  we  have 


[R(  X-P]TyP=fcp. 


(3.25) 


The  matrix  R(X  j is  a tridiagonal  matrix  in  the  ternary  case.  The  structure  of 


R(  X“P)  is  explicitly  shown  oelow. 


It  should  be  noted  chat  (3*25)  has  a unique  solution  for  all  DS  transition 
matrices.  If  R(  A _p)  were  singular,  then  A-p  would  be  an  eigenvalue  of  R. 
Since  R is  a stochastic  matrix,  however,  all  of  its  eigenvalues  have  modulus 
less  than  1 [32].  Because  |a|  < 1,  A pcannot  be  an  eigenvalue  of  R.  Thus,  in 
order  to  find  the  conditional  I. I.  moments  up  to  order  p,  the  vector  is 

calculated  from  the  right-hand  side  of  (3.24)  and  the  system  of  equations  in 
(3.25)  solved. 

It  is  interesting  to  specialize  (3-25)  in  order  to  obtain  the  mean  values 
(p=1).  In  this  case,  we  have 


*1  = 


(3.27) 


By  solving  the  equation 


0 

a 
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[R(  X_1)]T  y_ 1 = ( 1 /c )i-|  (3.28) 

where  M =(1/c)  U.  , we  see  that  the  mean  values  are  directly  proportional  to 
c . 

Finally,  in  regard  to  the  conditional  moments,  we  should  note  the 
calculation  of  the  central  conditional  moments,  which  we  will  utilize  in  the 
next  chapter  when  evaluating  the  error  probability  for  a simple  decision 
feedback  detector  based  on  the  DS  process.  The  adjustment  is  straightforward. 
Letting 


00  1 ^ 

M(jw,k  )=/  eJ“  ( Z "ii-k^FUl  k)  = e"j(UMk  M(jw  ,k) 


(3.29) 


be  the  conditional  I.I./DS  characteristic  function  when  the  I. I.  is  translated 


by  its  mean  value,  we  have 

d 

M(jw  ,k)=  l P^e^  te(k-U-Pk  + Xyl  ]M(jXw  ,1). 
1=1 


(3.30) 


Now  the  expression  tor  the  central  moments  becomes 


pP  = l P [c(k-l)-  uk  + X ux  ]+  X u 
k 1=1 


(3-31) 


Thus,  to  find  the  central  moments,  we  substitute 


c(k-l)-  uk  + X y 


(3.32) 


for  c(k-l)  in  the  right-hand  side  of  (3.2M). 


&£  Error 


In  section  3-1  we  derived  the  expression  for  the  error  probability  in  terms 
of  the  conditional  probabilities,  (3.9),  as 


P(E)=  l P(E  I J)P(o 
j=1 


(3-33) 


In  the  preceding  section  we  derived  the  joint  moments  of  the  I. I.  and  DS. 
Because  the  I. I.  and  noise  are  Independent,  the  conditional  distribution  for 
their  sum  is  found  from  the  convolution 


/ / FN|a  ( 5“xIJ)dFI|a  (xl  j)dS 

-»  xelj  1 


(3.3*0 


where  Ij  is  the  range  of  the  conditional  I. I.  random  variable.  The  conditional 
noise  distribution  is  FN ( * ) since  the  noise  is  independent  of  tne  DS.  Recalling 
that  we  have  assumed  the  noise  is  Gaussian,  we  may  express  the  conditional  I. I. 
plus  noise  distribution  as 


FI+N|a  (e  I J>=  / / *U -x)dFj  | 0(x|  j)« 


-<*>  xelj 


where 


(3-35) 


$ (z)=  ~ — e"z  /N0, 
7TRT 


(3.36) 


is  the  Gaussian  density  function  with  variance  equal  to  Nq/2. 
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If  we  expand  (3-35)  into  a Taylor  series  about  the  point  £ , we  obtain 


Fi*»|o  ( e lJ>=  / / j ±2l_  dTj.^xij)  « 

— x£Ij  $,=0  £! 


(3.37) 


where  f^(  5 ) denotes  the  4 th  derivative  of  ®(')  evaluated  at  the  point  £ 
Expression  (3-37)  can  be  rearranged  to  produce 


J *(5  )d5  + l 


(-1)  e 

/4>(<l)(C) d£  / x^dFjl  a(x|j)  = 

£=1  *!  -a.  xel. 

j 


(3.38) 


(~f ) 


/ *u  ><u  + y 

it  1 41 


$ 


(4-1) 


(e  ) u • 
j 


where  U are  the  conditional  I. I.  moments.  Before  we  calculate  the  total 

j 

error  probability,  we  must  mention  the  evaluation  of  the  derivatives  of  $(z). 
Fortunately,  these  may  be  evaluated  recursively  since  (see,  e.g.,  [15]) 


*(£)(«  )=(-(2/N0]J*/2  Hl  [ 2f/N0] 


e'c2/N0 


(3.39) 


where  H (*)  is  the  Hermite  polynomial  of  degree  4.  The  Hermite  polynomials  are 
recursively  related  as 


H j(z)  = zH£_i (z)  - (4-1 )H£_2(z) 


(3. W) 


with  the  initial  conditions  Hq(z)  = 1 and  H-j(z)  = z.  We  may  substitute  (3.38)  into 
(3.9)  using  (3-39)  to  find  the  conditional  error  probability.  After  some 
algebraic  manipulation,  we  arrive  at 
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P(E 


l-»  -<«!.!* ♦*,.!»( 

+ E -L—  [2/Nj- 
£-1  £l  V*tt 


,£/2  "1/4N0  l 
1 e Uj 


(3.41) 


where 


Q(x)  = J e"?  11  d§  • 

V2tt  x 

Further  simplification  of  (3*41)  is  possible  by  noting  that 


(3.42) 


qj,j+1  + qjJ-1  + 2qj,J=  UqJ,j 


(3.43) 


In  addition,  we  note  that  the  Hermite  polynomials  have  the  following  property 


H£(x)=  (-1)  H£(-x). 


(3.44) 


In  words,  the  Hermite  polynomials  of  even  order  are  even  functions  of  their 
arguments,  and  those  of  odd  order  are  odd  functions.  These  considerations  allow 
us  to  rewrite  (3.41)  as 





P(E|a-j)  - (1  +q.  ,)<*'  7=1+  " 7=  (2/N  Z/2H  ([2N  ]^) 

£«1  i:  u * 


^f(qj#J  + qJ.j-l>  + (-1)i(qj,j  +qJ.J+l)]-  (3'q5) 


The  total  error  probability  is  found  to  be 


P(E)  * (1  + Pft)  Q 


V2N0 


* 1 "1,'4Nn  _fc 

+ E = e H i([2N0]  *) 

x-i  r.  J K 11  0 


(3.46) 


where  Pq  denotes  the  probability  that  the  channel  symbol  is  zero.  The  first 
term  in  (3-46)  is  seen  to  be  the  error  probability  for  the  code  when  the  I.I. 
is  zero,  and  the  infinite  series  is  the  effect  on  the  error  probability  by  the 


In  the  next  section  we  shall  present  some  results  of  this  analysis  for 
several  known  line  codes. 
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3.4  Illustration  of  the  First  Order  Model  &£  I .1 . 

We  have  developed  a method  to  evaluate  the  error  probability  for  a first 
order  model  of  1. 1.  There  are  two  parameters,  c and  X1  , both  of  which  can  be 
varied  to  produce  different  conditions.  The  parameter  c is  the  maximum  of  the 
error  channel's  impulse  response.  It  indicates  the  magnitude  of  the  I. I. 
relative  to  the  magnitude  of  the  signal.  Thus  we  may  adjust  c to  obtain  a 
variety  of  signal-to-I.I.  ratios.  In  addition,  the  sign  of  c is  related  to  the 
type  of  channel  in  that  a high  pass  channel  will  be  modeled  by  a negative  value 
of  c,  and  a lowpass  channel  by  a positive  value.  The  parameter  X indicates  the 
length  of  time  for  which  one  channel  symbol  interferes  with  its  successors.  If 
we  consider  the  number  of  symbols  for  which  the  error  channel  response  is 
greater  than  .1$  of  c as  a criterion  for  the  duration  (denoted  x ) , we  may  find 
the  corresponding  value  for  X as  ICT-^  . 

The  first  set  of  curves  which  we  display  represent  an  error  channel  with  x 
equal  to  100  symbols  (a  slow  response).  The  value  of  X corresponding  to  this 
is  .933.  In  Fig.  11  we  show  P(E)  for  the  PST  code  for  four  values  cf  c (viz. 
-. 1 ,0, .05, . 1 ).  The  zero  value  of  c is  equivalent  to  the  zero  1. 1.  case.  It  is 
seen  that  the  negative  value  of  c does  not  degrade  P(E)  as  much  as  the  eaual  but 
opposite  positive  value.  In  fact,  for  c e [-.05, -.005]  we  found  a slight 
improvement  in  the  error  probability  at  low  signal-to-noise  ratios.  An 
intuitive  explanation  for  this  is  the  similarity  of  tne  the  DS  random  process  to 
a random  walk  with  reflecting  barriers.  As  the  DS  increases  towards  a barrier, 
the  I. I.  tends  to  decrease  (become  more  negative).  The  probability  of  the  DS 
changing  directions,  however  (i.e.,  the  probability  of  a channel  symbol  with 
negative  weight),  increases  so  that  the  I. I.  tends  to  enhance  the  noise  margin 
(i.e.,  it  adds  constructively  to  the  signal).  When  the  SNR  is  low  this 
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constructive  I. I.  lowers  the  error  probability  slightly.  This  also  explains 
why  the  positive  values  of  c increase  the  P(E)  at  low  SNR  since  the  I. I.  tends 
to  be  destructive  (degrades  the  noise  margin)  most  of  the  time.  We  see  from 
Fig.  11  that  when  P(E)=10“^,  the  degradation  in  the  SNR  is  approximately  2 db 
for  the  severe  I. I.  case  (c=.1). 

in  Figs.  12  and  13  we  show  P(E)  for  several  known  codes,  including  our 
running  example.  PST.  The  other  codes  are  the  Bipolar[19],  MS43[20],  an  (8,6,3) 
code,  and  a (2,2,3)  code[21].  In  these  illustrations  X is  again  .933  and  c is 
.1  (in  Fig.  12)  and  -.1  (in  Fig.  13)-  At  this  high  level  of  I. I.  there  is  a 
considerable  difference  in  the  error  performance  of  different  codes.  For 
P(E)  = 10--’  and  c=-.1,  the  difference  between  the  best  and  worst  is  approximately 

2.6  db,  while  for  c=.1,  this  difference  is  approximately  3-3  db. 

« 

Next  we  present  an  identical  set  of  curves  for  X equal  to  .501.  This 
corresponds  to  a fast  channel  response  ( t=10).  In  Fig.  14,  P(E)  is  plotted  for 
the  rST  code  with  values  of  c corresponding  to  Fig.  11.  The  worst  error  occurs 
again  for  c=.1  which  degrades  the  SNR  by  approximately  1.8  db  at  P(E)=10“^.  A 
careful  comparison  of  Fig.  11  with  Fig.  14  reveals  that  the  difference  in  error 
performance  between  X = .501  and  X = .933  is  small  for  PST,  the  maximum  being 
approximately  .5  db  when  c=-.1.  However,  when  we  compare  the  performance  of  the 
different  codes  in  Fig.  15  and  Fig.  1 6 , we  find  that  the  difference  between  the 
best  and  worst  error  performance  is  only  .9  db  when  X =.501.  The  error 
performance  of  MS43  and  (8.6,3)  is  considerably  improved  by  the  fast  response  of 
the  channel. 

In  order  to  illustrate  the  dependence  of  P(E)  on  the  duration  of  the 
channel  response,  we  have  plotted  the  error  probability  of  MS43  with  c= . 1 for 
several  values  of  t ranging  from  5 symbols  to  200  symbols  (Fig.  17).  ubserve 
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that  P(E)  increases  considerably  as  the  duration,  t , increases  from  10  to  50 
symools.  However,  for  t greater  than  50  symools  (corresponding  to  X > .871), 

the  uifference  is  not  great  (approximately  .6  db) . In  the  next  chapter,  we 
shall  investigate  the  behavior  of  the  I. I.  and  P(E)  for  the  case  of  a slow 


channel  response.  II 
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CHAPTER  4 

MODELS  FOR  SLOW  CHANNELS 


4.1  Digital  Sum  Channel 


In  the  previous  chapter,  we  have  observed  that  when  the  first  order  error 
channel  has  an  effective  response  of  50  symbols  or  greater,  the  parameter  X is 
close  to  unity.  With  this  in  mind,  we  shall  consider  a simplification  of  the 
first  order  model  via  its  limiting  behavior  as  X approaches  unity.  Upon 
substituting  X =1  into  the  definition  of  the  first  order  channel  response 
(3.11),  we  obtain 


0;  j<0 


hj  = 


(4.1 


c;  y* i 


At  first  glance,  it  is  easy  to  recognize  that  this  system  is  unstable.  For 
example,  if  a sequence  of  +1's  of  indefinite  length  were  transmitted  through 
this  channel,  the  magnitude  of  the  I.I.  would  increase  indefinitely.  Eut  this 
property  will  not  be  a disadvantage  for  this  analysis.  The  I.I.  at  some  time  k 
is 


k-1 


ik=  E caj=cak. 


(4.2) 


D=-° 


In  words,  the  I.I.  is  directly  proportional  to  the  DS  at  every  time.  Because 
this  analysis  is  limited  to  CE  codes  with  a finite  number  of  states,  the  DS  will 
always  be  finite.  Hence,  even  though  the  error  channel  (4.1)  is  unstable  in  the 
sense  that 
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I |hjl=  “ , (4.3) 

j-0 

it  is  stable  for  the  restricted  class  of  inputs  under  consideration.  For 
convenience,  we  shall  refer  to  this  channel  as  the  DS  channel. 

There  is,  however,  one  disadvantage  in  defining  the  I. I.  for  the  DS 
channel  as  in  (4.2).  The  values  and  statistics  of  the  I. I.  in  this  case  are 
dependent  on  the  choice  of  the  initial  value  of  the  DS  sequence.  Intuitively  we 
expect  that  the  I. I.  for  any  physical  channel  will  be  independent  of  this 
arbitrary  specification.  Indeed,  we  have  found  in  the  previous  chapter  that 
when  X is  less  than  1 , the  solution  for  the  conditional  moments  in  the  first 
order  model  is  unique  regardless  of  this  specification.  Since  we  want  to  make 
the  DS  channel  model  as  realistic  as  possible,  it  is  necessary  to  to  be  more 
careful  in  our  definition  of  the  I. I.  for  this  channel. 

We  may  approach  the  definition  or  the  I. I.  for  the  DS  channel  by 
considering  a property  of  CE  codes.  It  should  be  recalled  that  we  have  assumed 
the  channel  symbol  process  is  stationary  with  the  statistics  of  the  process 
derived  from  the  arithmetic  mean  of  the  statistics  of  each  phase  of  the 
cyclostationary  codeword  process.  Furthermore,  we  know  that  the  DS  at  any  time 
is  finite  for  a CE  code  with  a finite  number  of  states.  Clearly,  the  expected 
value  of  the  DS  is  finite.  From  the  definition  of  the  DS  (2.15),  we  note  that 
the  expected  value  of  the  DS  is 

k-4 

E(  ok)=  l E(aj)<  » . (4.4) 

j =-00 

From  (4.4)  we  conclude  that  E[aj]=0  for  any  CE  code.  Thus  the  I. I.  for  any 
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The  error  probability  for  the  DS  channel  is  not  difficult  to  compute. 
Since  the  DS  sequence  takes  on  d possible  values,  and  the  I. I.  is  directly 
proportional  to  the  DS,  the  I. I.  takes  on  values  in  the  finite  set, 


{ c o( j)  | j=1 , . . . ,d}. 


(4.10) 


The  probability  density  of  the  I. I.  contains  d impulses,  which  may  be  described 


l P( a =j)  5 [i-c  a (j) ] . 
i*=1 


(4.11) 


The  conditional  density  of  the  1. 1.,  therefore,  is  the  impulse  <5  (i-c  a (j)).  By 
convolving  this  impulse  with  the  Gaussian  density  function,  we  obtain  the 
conditional  density  for  the  sum  of  the  noise  and  I. I.  for  the  DS  channel  as 


<e  | e -<= 

o'* 


From  (3.9)  we  find  the  conditional  probability  of  error  as 


(1+  2 c 0'  ( j ) \ 

_ " ) 
/ 2N0  ' 

(l-2colj)  \ 

7 _ • ~ ) • 

/ 2N  / 


(4.12) 


(4.13) 


The  total  error  probability  is  found  by  averaging  over  the  marginal  events. 


The  error  probability  for  the  various  codes  discussed  has  been  computed  for 
the  DS  channel.  The  results  are  illustrated  in  Fig.  18  with  c equal  to  .1.  The 
comparison  of  Fig.  18  with  Fig.  12  ( X =.933,  c=.1)  in  Chapter  3 reveals  that  the 
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difference  in  these  two  sets  of  curves  is  negligible.  For  these  codes  at  least, 
the  DS  channel  is  an  excellent  approximation  to  a channel  with  a slow  response. 
Comparing  the  error  probability  curve  for  the  MS43  code  in  Fig.  18  with  the  set 
of  curves  shown  in  Fig.  17,  indicates  that  the  difference  between  an  effective 
channel  response  of  50  symbols  ( A = .871)  and  the  DS  channel  is  approximately  .5 
db  for  an  error  rate  of  10“^.  Furthermore,  among  the  codes  discussed,  the  error 
probability  for  the  MS43  has  been  found  to  be  the  most  sensitive  to  the  channel 
response  duration. 

At  this  point  it  is  appropriate  to  discuss  a simple  parameter  which  appears 
in  the  literature  as  a criterion  for  the  error  performance  for  codes  with 
bounded  DS  with  respect  to  I. I.  [7,22,23].  This  quantity  is  called  the  Digital 
Sum  Variation  (DSV),  which  is  defined  as  one  less  than  the  number  of  possible  DS 
values  that  an  encoded  sequence  may  assume.  Since  we  have  consistently  denoted 
the  number  of  DS  values  as  d;  the  DSV  in  this  discussion  is  defined  as  d-1. 
The  rule  of  thumb  for  judging  a code's  performance  is:  the  smaller  the  DSV,  the 
better  the  code.  In  [22],  Croisier  provided  the  motivation  for  the  use  of  the 
DSV  as  a performance  index  by  assuming  that  the  I. I.  is  directly  proportional 
to  the  DS.  This  assumption  was  made  after  a brief  heuristic  argument  concerning 
the  effects  of  a first  order  low  pass  filter  on  an  encoded  sequence.  The 
results  in  this  thesis  thus  far  have  hopefully  provided  an  analytical  basis  for 
this  assumption.  In  fact,  the  motivation  for  the  DSV  criterion  is  the 
assumption  that  the  DS  channel,  as  defined  in  this  section,  yields  a good 
approximation  to  the  error  probability  for  channels  with  a slow  response. 
Moreover,  the  DSV  criterion  is  essentially  the  peak  distortion  criterion  for  the 
DS  channel,  and  as  such,  it  can  be  used  in  defining  an  upper  bound  for  the  error 
probability  for  the  DS  channel. 


The  DSV  criterion  can  lead  to  erroneous  conclusions  when  comparing  code 


however.  In  Table  4 


discussed.  Upon  comparing  the  DSV  and  the  error  performance  illustrated  in 


18,  we  find  that  the  criterion  would  lead  to  an  incorrect  evaluation  of  the 


MS43  code  as  superior  to  the  (8,6,3) 


Table  4.  Digital  Sum  Variations 


Although  the  actual  error 


performance  was  not  illustrated,  it  was  noted  that  for  the  B6ZS  code  [25],  the 


extreme  values  of  the  DS  always  occur  so  that  the  I. I.  adds  constructively  to 


the  next  pulse,  thereby  enhancing  the  margin  against  the  noise.  This  conclusion 


was  made  assuming  a low  frequency  cutoff  channel  (which  is  equivalent  to 


assuming  c is  negative  in  the  DS  channel  model).  The  (3,6,3)  code  also  has  this 


property.  Unfortunately,  this  line  of  reasoning  does  not  explain  why  the 


(3,6,3)  code  provides  a better  error  performance  than  the  MS43  code  when  c is 


positive  (which  is  the  case  in  Fig.  18).  In  this  case  the  extreme  values  of  the 


DS  always  occur  so  that  the  I. I.  adds  destructively  to  the  next  pulse;  thus 


reducing  the  margin  against  the  noise.  The  next  chapter  will  investigate  these 
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questions  in  more  detail  when  we  propose  techniques  for  the  synthesis  of  CE 
codes  for  the  DS  channel.  We  shall  find  that  the  DS  probability  distribution, 
and  the  DS  transition  probabilities  (or  equivalently  the  joint  DS/code-symbol 
distribution)  can  have  a significant  effect  on  the  error  performance  of  the 
code.  When  all  of  these  factors  are  considered  (including  the  DSV),  we  obtain  a 
more  accurate  frame  of  reference  for  comparing  code  performance.  Nevertheless, 
at  this  point,  it  is  safe  to  say  that  a shadow  of  doubt  has  been  cast  on  the  DSV 
as  an  accurate  indicator  of  the  error  performance  of  CE  codes. 
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4.2  Decision  Feedback  Equalization  for  the  DS  Channel 


Processing  the  decisions  of  the  threshold  detector  in  a feedback  loop  in 
order  to  cancel  the  I. I.  is  an  old  idea  (see,  e.g.,  [2]).  In  the  past  ten 
years  this  idea,  now  referred  to  as  pecision  Feedback  Equalization  (DFE) , has 
received  considerable  attention.  It  has  been  found  that  significant  improvement 
in  performance  can  be  obtained  for  low  quality  channels  over  the  performance  of 
linear  equalization.  Lucky  provides  a brief  survey  with  an  extensive 
bibliography  on  the  subject  in  [5] . One  problem  with  DFE  is  the  propagation  of 
errors  in  the  detector  decisions,  which  has  led  to  the  constraint  of  allowing 
only  non-recursive  (transversal)  filters  in  the  feedback  loop,  thus  confining 
the  error  propagation  to  a finite  number  of  future  symbols.  In  this  section  we 
propose  a very  simple  DFE  scheme  for  the  DS  channel.  Since  the  I. I.  is 
directly  proportional  to  the  DS,  the  repeater  can  keep  track  of  the  DS  after 
each  decision  by  the  threshold  detector  and  compensate  for  the  I. I.  before  a 
decision  is  made  on  the  next  symbol.  Fig.  19  illustrates  a model  for  this 


scheme . 


Figure  19.  Model  for  digital  sum  feedback 


Assuming  no  errors  in  the  detector  output  up  to  time  k,  we  have 

**k  = rk"c  ak  = ak+  5 k* 

In  this  case  we  see  that,  after  compensation,  the  received  signal  is  free  of 
I. I.  for  the  DS  channel.  Thus  the  error  probability  for  this  system  is 
identical  to  the  error  probability  for  a system  with  no  I. I. 

As  with  any  decision  feedback  scheme,  the  possibility  of  error  propagation 
does  exist.  In  fact,  because  the  filter  in  the  feedback  loop  is  recursive  (and 
unstable  for  unrestricted  inputs),  the  error  propagation  could  have  infinite 
duration,  and  the  magnitude  of  the  error  could  become  unbounded.  We  shall 
discuss  how  to  avoid  these  problems  later  in  this  section,  but  we  will  first 
briefly  mention  the  effects  of  a single  error  on  the  performance  of  this  scheme. 

If  at  some  time  k,  the  threshold  detector's  decision  is  in  error  (i.e., 

A A 

a^a^),  the  computed  DS,  will  also  be  in  error.  In  this  case, 
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rk=ak+  ^ k+c^  ^ k"  °k^ * 


(4.15) 


The  most  common  error  is  ak=ak * 1,  that  is,  a channel  symbol  is  detected  as  its 
adjacent  symbol  on  the  real  line.  Wg  shall  refer  to  such  an  error  as  a single 
error.  Note  that  this  is  not  the  standard  definition  of  a single  error  which 
appears  in  the  coding  literature.  For  a single  error,  (4.15)  becomes 


rk=  ak±c. 


(4.16) 


If  is  close  to  its  maximum  or  minimum,  we  see  that  even  with  a single  error, 
the  compensated  signal  will  be  an  improvement  since  most  of  the  I. I.  will  be 
removed.  If  the  DS  is  near  zero,  however,  the  compensated  signal  will  be 
degraded,  or  at  least  not  improved. 

The  fact  that  the  DS  for  CE  codes  is  a finite  random  variable  allows  for 
the  possibility  of  error  monitoring  in  the  repeater.  This  monitor  simply  checks 
o k at  every  k to  see  that  it  is  a valid  DS  value.  If  a k is  not  an  allowable 
value,  it  increments  a counter  which  keeps  a running  average  of  such  DS 
violations.  If  this  running  average  exceeds  a predetermined  threshold,  the 
repeater  may  send  an  alarm  signal  to  one  of  the  terminals.  A faulty  repeater 


may  be  easily  isolated  in  this  manner.  In  the 


(DSF)  scheme 


in  Fig.  19  this  monitor  may  be  utilized  to  correct  o k by  the  following  rule: 


ak+1  ’ °k=CTmin"1 


a k~ 1 ' °k=  amax+1 


(4.17) 


When  errors  occur  infrequently,  single  errors  will  be  compensated  for  in  the  DSF 
device  if  the  encoder  reaches  the  appropriate  extreme  DS  state  before  another 
error  occurs.  Even  if  a long  burst  of  errors  were  to  occur,  the  error  in  the 
estimated  DS  is  overbounded  by  amax-  CTmj.n. 

For  DSF  equalization  it  is  obvious  that  the  Digital  Sum  Variation  is 
irrelevant  to  the  error  probability  when  we  consider  the  error  free  analysis. 
However,  when  we  consider  the  equalizer  with  the  DS  correction  device,  the  DSV 
has  some  redeeming  value.  When  errors  occur  infrequently,  one  measure  of  the 
effectiveness  of  the  correction  device  in  the  feedback  loop  is  the  average 
number  of  symbols  until  the  appropriate  terminal  DS  state  is  reached.  The  worst 
case  occurs  when  is  an  extreme  value  and  the  symbol  error  at  time  k yields  a 
DS  estimate  of  its  adjacent  value.  The  output  of  the  equalizer  will  not  be 
corrected  until  the  other  extreme  value  is  reached  by  the  encoder 
Consequently,  the  worst  case  average  time  until  correction  will  depend  on  the 
DSV  for  the  code  in  question. 


4.3  Performance  sf  E££  £qh  fcJie.  First  Order  1. 1.  Model 

If  we  utilize  the  DSF  scheme  on  channels  which  we  model  as  first  order  with 
A<1  , it  is  clear  that  the  1. 1.  is  not  entirely  eliminated  at  each  time  since 
the  variance  of  the  conditional  I. I.  random  variable  is  not  zero  (the  density 
is  not  impulsive).  Nevertheless,  we  would  expect  an  improvement  as  a 
consequence  of  the  removal  of  a strong  bias  in  the  I. I.  (at  least  when  the 
effective  channel  response  is  50  symbols  or  more). 
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The  error  probability  for  such  channels  with  DSF  equalization  is  easily 
computed  using  the  techniques  discussed  in  the  previous  chapter.  The  received 
signal  after  compensation  is 

rk=ak+ik-  k+  ^ k*  (4.18) 

where  U ^ is  the  conditional  mean  value  of  the  I. I.  for  a We  may  consider 
the  I. I.  random  variable  to  be 

*k  = ^k“ u k (4.19) 

which  is  the  translation  of  the  conditional  I. I.  random  variable  by  its  mean. 
Thus  the  error  probability  may  be  computed  from  expression  (3.46)  by 
substituting  the  central  conditional  moments  for  the  moments. 

The  improvement  in  the  error  probabilty  which  results  from  the  DSF  scheme 
is  most  dramatically  illustrated  in  the  case  of  the  MS43  code.  Fig.  20  shows 
P(E)  for  various  values  of  the  channel  response  duration,  x . The  sensitivity 
to  the  duration  has  been  reduced  significantly.  At  an  error  rate  of  10“^,  the 
greatest  degradation  in  the  signal-to-noise  ratio  relative  to  the  zero  I. I. 
case  is  approximately  1.3  db.  In  Chapter  3 we  found  a difference  of  4.3  db, 
so  that  the  DSF  scheme  has  improved  the  SNR  by  3 db.  In  addition,  it  is  clear 
that  for  the  DSF  scheme  the  worst  performance  occurs  for  fast  channel  responses. 
This  is  expected  because  the  DS  model  represents  the  limiting  behavior  of  the 
model  as  the  duration  approaches  infinity.  We  note,  however,  that  a slight 
improvement  is  achieved  even  for  the  fast  responses.  This  indicates  that  the 
DSF  scheme  may  be  useful  in  eliminating  I. I.  for  the  slow  part  of  a channel 
response  without  increasing  the  deleterious  effects  the  I. I.  for  the  fast  part. 
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mode  is  "small,"  there  are  segments  of  the  real  line  for  which  the  probability 
of  the  I. I.  being  contained  in  these  segments  is  essentially  zero.  Because  the 
Gaussian  function  cannot  approximate  this  behavior,  the  error  probability 
obtained  from  it  is  overly  conservative. 

In  this  section,  we  shall  propose  the  utilization  of  the  Gaussian 
approximation  for  the  conditional  I. I.  densities  (i.e.,  we  assume  that  the  I. I. 
is  Gaussian  when  conditioned  on  the  DS).  When  X is  approximately  unity  we 
would  expect  this  to  yield  a good  approximation  to  the  error  probability  since 
the  conditional  variances  are  small.  In  order  to  see  this,  consider  the  set  of 
linear  equations  for  the  conditional  variances: 


2 , ,_2  . 2 

" pk, k-1°  k-1+(  x ' pk,k)_  pk,k+1°k+r 

~2  ^pk,k-l(c‘uk+Xp,k-l)  + pk,k("M'k+Xy'k)  +pk, k+l( "c"M,k+Xp,k+l)  ^ 


(4.20) 


where  a2  denotes  the  conditional  variances,  and  yk  the  conditional  means. 
Recall  that  the  conditional  means  are  directly  proportional  to  c and  that  in  the 
limit  as  X -*■  1 , the  mean  values  are  equally  spaced  with  the  separation  being  c. 
These  considerations  allow  the  rearrangement  the  right-hand  side  of  (4.20), 
after  which  we  obtain 


~ (1“  X )2C  P k , k- 1 yk-1+  pk,k  y k+  pk,k+1y2k+1]  • 


(4.21) 


Recalling  that  ( 1-  X ) is  a factor  of  the  characteristic  polynomial  of  R(X  P ), 
expression  (4.21)  allows  us  to  state  that  the  conditional  variances  are  directly 
proportional  to  (1-  X )p,  where  p is  at  least  1. 
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Another  result  which  is  not  illustrated  graphically  is  that  the  difference  in 
P ( E)  for  negative  or  positive  c is  negligible  in  all  cases.  Finally,  it  was 
also  found  that  the  difference  in  error  performance  of  the  various  codes  was 
reduced  significantly  with  the  DSF  scheme.  In  Fig.  12  in  the  preceding  chapter 
we  found  that  the  difference  between  the  best  and  worst  codes  was  approximately 
3.3  db.  for  slow  channels.  With  the  DSF  scheme  we  found  a difference  of  less 
than  1 db. 


4.4  Approximating  the  Probability  Density  of  the  I. I. 

We  have  found  the  DS  channel  to  be  a reasonable  approximation  to  the  first 
order  I. I.  model  with  a slow  response.  This  model  indicates  that  the 
probability  density  of  the  I. I.  is  multimodal.  It  is  interesting  to  note  that 
this  multimodal  property  appears  in  [2,  p.  82]  in  an  example  of  a channel  with 
an  exponential  frequency  response  with  raised  cosine  pulse  shaping  and  an 
independent  equiprobable  binary  channel  sequence.  The  density  in  this  case  was 
found  by  the  exhaustive  method  for  the  truncated  time  response. 


It  is  desirable  to  find  a good  approximation  for  the  I. I.  density  when  the 
parameter  X is  less  than  1.  It  is  well  known  that  assuming  the  I. I.  as 
Gaussian  results  in  a poor  approximation  to  the  error  probability.  The 
explanation  for  this  has  been  based  on  the  fact  that  for  any  realistic  channel 
the  impulse  response  decays  exponentially.  The  maximum  of  the  1. 1.,  therefore, 
is  finite  and  the  "tail"  of  its  probability  density  cannot  be  Gaussian  (see, 
e.g.,  [2]).  The  multimodal  character  of  the  I. I.  density,  however,  suggests  a 
stronger  argument  against  the  Gaussian  approximation.  When  the  variance  of  each 
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As  X approaches  unity,  the  conditional  density  function  becomes  narrower 
until  it  "collapses"  into  an  impulsive  density.  We  may  assume  this  density  to 
be  Gaussian  with  the  mean  value  equal  to  the  conditional  mean,  and  variance 
equal  to  the  conditional  variance.  This  approximation  is  convenient  because  the 
density  of  the  I. I.  plus  noise  is  also  Gaussian,  with  mean  and  variance  equal 
to  the  sum  of  the  noise  and  I. I.  means  and  variances,  respectively.  The 
conditional  error  probability  is  then 


(qj,j  //(2irr2)"te-(5  - uj)2/2rj  d£ 

” 00 

(qj,j+qJ,J-1)/(27rrj)"1e'(c " Uj)2/2rj  d? 

1/2 


(4.22) 
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where  T =(Nn/2)+a'r  is  the  conditional  variance  of  the  noise  plus  1. 1.  After 
: u J 

changing  the  variable  of  integration  and  averaging  over  the  marginal  events  we 
obtain  the  total  error  probability  as 


P(E)=  l P(a  = j){(qj)j+qj)j+1)Q(l_^i 
j = 1 


2Fi 

„ , 1 - 2u  . 

l<‘j.r<|j,j-i)Q( x 

2r . 

: 


(4.23) 


We  shall  also  use  this  approximation  for  the  DSF  equalization  scheme  discussed 

in  this  chapter.  In  this  case  the  combined  I. I.  and  noise  is  zero  mean 

2 

Gaussian  with  variance  I\  , and  the  error  probability  is 


The  accuracy  of  this  approximation  was  tested  on  the  codes  discussed.  It 
was  found  that  the  error  probabilities  resulting  from  the  approximation  agreed 
with  the  true  error  probabilities  to  two  significant  digits  when  the  effective 
channel  response  was  50  symbols  or  greater.  The  maximum  SNR  for  which  these 
comparisons  were  made  was  18  db.  The  greatest  discrepancy  between  the 

approximate  and  the  true  P(E)  occurred  with  the  MS43  code  when  the  effective 

response  was  only  5 symbols.  This  difference  is  approximately  1 db. 

A final  note  on  the  accuracy  of  the  Gaussian  approximation  for  the 

conditional  I. I.  densities  concerns  the  asymptotic  behavior  of  the  error 

probability  as  the  SNR  approaches  infinity.  The  actual  limiting  behavior  may  be 
determined  by  assuming  the  channel  to  be  noiseless.  In  this  case,  the  error 
probability  results  from  the  I. I.  only.  The  conditional  error  probability  is  a 
special  case  of  expression  (3-9).  We  find 

If  we  consider  the  DS  channel  model,  we  find  that  the  conditional  I. I. 
probabilities  are 


Ft  I a (-1/21 j)  = 


1;  ca  ( j)  < -1/2 


(4.26) 


[O;  otherwise 


and 
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0 
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Fj  I 0 ( 1/21 j) = 


1 ; ca  (j)  > 1/2 


0;  otherwise. 


(4.27) 


The  limiting  probability  will,  therefore,  be  zero  if  and  only  if  a <1/2c. 

max 

When  we  use  the  Gaussian  approximation  for  X < 1 , the  noiseless  error  probability 
will  always  be  non-zero.  However,  the  actual  limiting  noiseless  error 
probability  may  be  zero.  When  the  conditional  variances  are  small  ( X close  to 
unity),  this  difference  should  not  be  significant. 


In  this  section  we  shall  present  some  results  of  simulation  of  the  first 
order  I. I.  model.  The  simulation  method  is  a straightforward  model  of  the 
encoder  and  first  order  channel.  Because  the  input  binary  sequence  was 
generated  by  a uniform  random  number  generator,  this  sequence  consisted  of 
independent  equiprobable  random  variables.  The  initial  state  of  the  encoder  was 
selected  arbitrarily.  At  each  symbol  time,  the  DS  was  computed  and  the  I. I. 
value  was  stored  in  an  array  specified  by  the  value  of  the  DS.  At  the  end  of 
the  simulation,  the  I. I.  data  for  each  value  of  the  DS  were  sorted  and 
quantized  into  intervals  of  width  .001.  We  calculated  the  probability  that  the 
conditional  I. I.  value  lies  in  a given  interval  by  dividing  the  number  of 
occurrences  of  the  event  by  the  total  number  of  occurrences  of  the  appropriate 


DS  value. 


i 


First  we  shall  present  the  "density"  obtained  for  the  Bipolar  code,  when 
X=.5  and  c= . 1 . It  is  known  that  the  total  I. I.  density  in  this  case  is 


uniform  [8,26].  The  results  shown  are  from  a 90012  symbol  simulation:  the 

density  for  a=0  is  shown  below  in  Fig.  21,  and  the  density  for  a =1  is 
identical,  except  for  translation  on  the  horizontal  axis. 


Figure  21.  Conditional  I. I.  density  for  Bipolar 


The  conditional  density  shown  in  Fig.  21  is  a reasonable  approximation  to 
the  uniform  density.  Although  we  are  making  no  claims  about  the  statistical 
significance  of  the  simulation  results,  we  can  be  assured  that  the  densities  we 
obtain  are  a useful  estimate  of  the  actual  densities.  The  results  provide 
evidence  for  the  accuracy  of  the  Gaussian  approximation  for  the  conditional  I. I. 


densities . 
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Fig.  22  illustrates  some  of  the  conditonal  densities  found  for  the  PST 
code.  We  show  only  those  for  the  DS  equal  to  zero  since  each  conditional 
density  was  similar  in  shape.  The  channel  response  durations  we  show  are  for 
i = 10,  50,  and  100  symbols  ( X=. 501, .871,  and  .933),  and  the  constant  c in  each 
case  is  .1.  The  shapes  of  the  conditional  densities  for  the  slow  responses  are 
smooth  and  similar  to  the  Gaussian  density.  We  also  see  that  for  the  fast 
response,  the  density  is  not  smooth.  It  is  reasonable  to  assume  that  the  I. I. 
in  this  case  is  a discrete  random  variable.  This  illustrates  the  difficulty  of 
attempting  to  determine  the  densities  analytically.  In  Fig.  23,  we  show  the 
conditional  densities  for  the  MS43  code  for  the  same  parameters.  The  shapes  are 
similar  to  those  of  the  PST.  This  was  found  to  be  true  for  the  other  codes  as 
well.  The  densities  for  the  MS43  are  not  as  smooth  as  those  of  the  PST, 
however,  which  is  probably  a result  of  fewer  data  points  in  the  simulation.  The 
significance  of  the  "spikes"  at  zero  in  the  MS43  densities  is  also  difficult  to 


determine. 


ao 

(a)  10  symbol  duration 


(b)  50  symbol  duration 


(c)  100  symbol  duration 


Figure  22.  Conditional  densities  of  PST  for  various  response  durations 
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CHAPTER  5 


CODE  DESIGN  FOR  I. I.  CONTROL 


Remarks 


The  analytical  tools  developed  thus  far  allow  us  to  investigate  the  impact 
on  error  performance  the  designer  has  through  the  judicious  selection  of  CE 
codebooks.  As  noted  in  the  introduction  (Chapter  1),  techniques  for  minimizing 
the  effects  of  I. I.  have  been  studied  from  many  viewpoints;  providing  many 
pulse  shaping  schemes,  equalization  (fixed  and  adaptive),  <.  nd  coding  techniques. 
Many  of  the  coding  techniques  available  are  in  the  class  of  counter-encodabi c 
codes.  These  include  the  Bipolar,  PST,  MS43,  partial  response,  and  4B-3T  [27l 

The  class  of  filled  bipolar  codes  [e.g.  CHDBn  [22],  and  BnZs  [25]),  are  also  CE 
codes,  however,  with  variable  length  codebooks.  This  leads  us  to  the  problem  of 
selecting  codewords  for  a CE  code  so  that  the  statistics  of  the  channel  sequence 
produce  good  error  performance.  In  this  chapter,  we  will  study  the  analytical 
results  we  have  obtained  for  the  first  order  I. I.  model  in  order  to  extract 
some  insight  into  the  synthesis  problem.  We  will  study  the  case  where  the 
detector  is  a threshold  slicer  for  which  the  multimodal  nature  of  the  I. I. 
density  is  the  dominant  characteristic,  and  the  case  of  the  DSF  equalization  for 
which  the  consequences  of  the  multimodal  nature  of  the  I. I.  density  have  been 
removed.  First,  we  discuss  the  preliminary  aspects  of  designing  CE  codes. 


5.2  Constraining  EactLOT 3 IflC.  CEL  £Q<tebQ.Pks 


Before  we  may  concentrate  on  choosing  the  set  of  codewords  which  have  the 
most  desirable  I. I.  properties,  it  is  necessary  to  determine  the  constraints 
which  the  structure  of  CE  codes  place  on  these  sets.  Recall  that  a CE  code  is 
identified  as  an  (m,n,r)  code  where  n is  the  codeword  length  (ternary),  m is  the 
blocklength  of  the  binary  information  sequence  and  r is  the  number  of  states  in 
the  encoder.  The  first  constraint  follows  directly  from  the  definition  of  the 
state  transitions  for  a CE  code.  Recall  that  the  state  transition  function  is 

a(st+1)=  ff(at)+  Y(at)  (5.1) 

where  Y(sl^)  is  the  algebraic  sum  of  the  codeletters  of  the  codeword  at  wordtime 
t.  Since  the  set  of  states  is  finite  and  0 is  a 1-1  function  there  is  a least 
integer  0(i)  for  some  state  i.  When  the  encoder  is  in  this  state  i,  we  have 
the  restriction 

Y(a)>0  (5.2) 

for  any  a which  is  in  state  i.  We  refer  to  the  codebock  for  this  state  as  the 
positive  reflecting  mapping  for  the  CE  code.  A similar  argument  demonstrates 
that  there  is  another  state,  say  j,  such  that  Y(a)-^  0 for  every  a in  state  j. 
The  codebook  for  this  state  is  called  the  negative  reflecting  mapping.  Since 
the  set  of  states  S is  denoted  by  { 1 , 2 , . . . , r} , we  let  state  1 be  associated 
with  the  positive  reflecting  mapping  and  state  r with  the  negative  reflecting 
mapping.  In  this  case  we  are  assuming  that  the  function  o is  a strictly 
increasing  function  of  the  integers.  For  the  majority  of  situations  we 
consider,  the  function  0 is  °(i)=i.  The  existence  of  the  positive  reflecting 
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mapping  and  the  necessity  of  providing  2m  codewords  in  each  state  leads  to  a 
necessary  condition  for  the  existence  of  an  (m,n,r)  code.  Let  Cj  denote  the 
number  of  ternary  n-tuples  which  have  characteristic  j,  then 


l Cj»2m 

j=0 


(5.3) 


is  this  necessary  condition.  To  demonstrate  the  sufficiency  of  (5-3) , we  must 
enumerate  the  Cj's.  Toward  this  end,  note  that  the  concatenation  of  two 
codewords  having  lengths  n1  and  n2>  and  characteristics  and  Y2 

respectively,  yields  a codeword  having  length  n^n2  and  characteristic  Y^  Y2 . 
This  suggests  the  generating  function  (x^  + l+x1  )n  as  the  characteristic 
enumerator,  with  the  exponents  indicating  the  characteristics  and  the 
coefficients  being  the  number  of  words.  Another  restriction  we  place  on  the 
codebook  is  that  the  n-tuple  of  all  zeroes  be  ineligible.  The  motivation  for 
this  restriction  lies  in  the  engineering  of  the  regenerative  repeaters.  These 
repeaters  derive  the  timing  information  for  the  sampling  operation  from  the 
received  sequence.  If  a long  sequence  of  0's  occurs,  there  would  be  no  timing 
information  during  this  interval,  and  the  sampler  could  lose  symbol 
synchronization.  Thus  we  must  subtract  1 from  the  constant  term  in  the 
enumerator,  after  which  we  obtain 


(x_1  + 1 + x)n-1=  l CjX^. 


(5.4) 


The  symmetry  of  the  enumerator  implies  Cj=C_j  for  every  j=1,...,n.  Furthermore, 
ignoring  the  -1  term  for  a moment,  it  is  clear  that  Cj*  if  k.  Including 
the  term  -1  forces  the  inequality  to  be  weakened  to 
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Cj^Ck  for  |j|<|k|.  (5.5) 

Now,  let  us  consider  the  encoding  mode  (mapping)  for  state  k,  k<  r.  If  a is 
a codeword  in  this  mode,  the  state  transition  function  requires 

1-k  ^ y (a)  < r-k.  (5.6) 

Let  Ak  denote  the  set  of  codewords  satisfying  (5.6),  which  we  shall  refer  to  as 
the  set  of  k-eligible  codewords  (e.g.,  the  set  of  codewords  satisfying  (5.2)  is 
the  set  of  1-eligible  codewords,  A^).  From  the  definition  of  the  set  Ak,  we  see 
that 


r-k 

|Ak|=  l Cy  (5.7) 

1-k 

Inequality  (5.5)  ensures  that  J Ak | 5-IaJ  , therefore  the  condition  (5.3)  which 
is  also  equivalent  to  |A1|$2m,  is  a necessary  and  sufficient  condition  for  the 
existence  of  an  (m,n,r)  CE  code.  Table  5 shows  the  results  of  this  enumeration 
for  blocklengths  up  to  seven. 

Table  5.  Enumerator  Coefficients  for  n £ 7 


Y 

0 

1 

2 

3 

4 

5 

6 7 

n 

1 

1 

1 

2 

2 

2 

1 

3 

6 

6 

3 

1 

4 

13 

16 

10 

4 

1 

5 

50 

45 

30 

15 

5 

1 

6 

140 

126 

90 

50 

21 

6 

1 

7 

392 

357 

266 

161 

77 

28 

7 1 

, 
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Obviously,  for  a fixed  codeword  length,  n,  it  is  most  desirable  to  maximize 
the  binary  blocklength,  m.  From  the  condition  (5.3) , it  is  a simple  matter  to 
find  the  maximum  value  of  m for  a given  codelength.  In  addition,  from  the 
standpoint  of  hardware  complexity,  it  is  desirable  to  minimize  the  number  of 
states,  r.  For  any  pair  (m,n)  the  minimum  number  of  states  is  the  smallest  r* 
for  which 


I |Aj|>  2m. 

j=0 


(5.8) 


With  the  aid  of  Table  5,  the  code  parameters  yielding  the  greatest  rate 
increases  for  codeword  lengths  up  to  seven  have  been  determined  with  the  minimum 
number  of  states  necessary  for  a CE  realization.  Table  6 provides  tb ' 
information . 


Table  6.  Code  parameters  realizing  greatest  rate  increase 


Rate  Increase 
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The  code  design  problem  is  simplified  by  considering  a sub-class  of  the 
general  CE  codes,  which  we  refer  to  as  balanced  CE  codes.  A few  conventions 
will  oe  useful  prior  to  the  definition  of  these  codes.  A codeword  a,  is  the 
complement  of  a.  if  -a=a  where  the  minus  sign  indicates  the  integer 
multiplication  of  each  code  digit  by  -1  (e.g.,  the  complement  of  +0-  is  -0+). 
Moreover,  the  encoding  mode,  f^,  for  some  state  i is  balanced  if 
a ef^++  a e f^.  Finally,  two  encoding  modes,  fi  and  f j , are  complementary 
modes  if  ae  f^  -*■  a e f j . A CE  code  is  defined  as  a balanced  code  if  every  mode 
is  either  balanced  or  the  complement  of  another  mode;  furthermore,  if  fi  is 
unbalanced  then  fr_£+^  is  its  complementary  mode. 


All  of  the  CE  codes  which  have  been  discussed  in  this  thesis  are  balanced 
codes.  Referring  to  the  codebook  for  the  PST  code  in  Chapter  1 (Table  1),  it  J3 
readily  seen  that  the  two  states  are  complementary.  The  synthesis  of  balanced 
codes  is  somewhat  easier  than  that  of  general  CE  codes  because  only  half  of  the 
codewords  must  be  selected.  For  example,  if  a 2-state  code  is  designed,  the  two 
states  must  be  complementary.  Once  the  codewords  for  one  state  are  chosen,  the 
codewords  for  the  other  state  are  the  complements. 

The  analysis  of  the  DS  process  is  also  simplified  for  balanced  codes.  For 
every  occurrence  of  a=i,  i=1,...,d,  in  the  DS  table,  there  is  an  occurrence  of 
o=d-i+1  which  implies  P(o  =i)=P(a  =d-i+1).  Because  the  expected  value  oi  the 
DS  is  zero,  the  values  of  the  DS  are  o (i)=i-(d+1 )/2;  that  is,  the  I.I.  for 
the  DS  channel  is  symmetric  about  the  origin.  These  considerations  will  be 
useful  in  the  next  section  when  we  analyze  the  error  probabilities  for  CE  codes 
and  develop  some  ideas  concerning  their  synthesis. 
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There  is  another  desirable  property  of  CE  codes  to  consider  in  this 
section.  This  property  is  called  State  Independent  Decodabilitv  (SID).  A CE 
code  is  SID  if  the  decoder  is  able  to  map  the  received  ternary  n-tuple  into  the 

< 

correct  binary  m-tuple  without  knowledge  of  the  encoder's  state.  Of  course  this 

assumes  that  no  transmission  errors  have  occurred.  Referring  to  the  codebook, 

this  definition  is  equivalent  to  saying  that  a ternary  n-tuple  may  appear  in  at 

most  one  row  for  an  SID  code.  A sufficient  (but  not  necessary)  condition  for 

state  independent  decodability  is  (see  [20],  p.  147): 

for  any  two  states  u,v  such  that  1 < u<  v4  r,  and 
a codeword 

aeuflv  -*■  i eu+1.  (5.9) 

In  the  next  chapter,  we  shall  discuss  the  practical  consequences  of  decoding 
state-dependent  or  state-independent  decodable  codes.  For  the  remainder  of  th’S 
chapter,  we  shall  incorporate  the  SID  property  into  the  code  synthesis  problem 
for  I. I.  control. 

— * 1 

The  existence  arguments  presented  in  this  section  define  the  sets  of 
codewords  which  are  eligible  for  insertion  into  a CE  code.  Any  synthesis 
algorithm  thus  begins  by  partitioning  the  set  of  3n-1  ternary  n-tuples  according 
to  characteristic.  Then  the  sets  of  k-eligible  codewords  may  be  formed  for 
k=1 r.  The  codebook  may  be  filled  by  selecting  n-tuples  from  these  sets  and 

I 

inserting  them  into  the  appropriate  mappings.  Since  the  codes  of  interest  are 

1 n 

non-linear,  we  are  free  to  choose  the  codewords  by  any  rule  we  desire.  The 
remainder  of  this  chapter  is  concerned  with  developing  selection  rules  based  on 
the  error  performance  of  channel  sequences  for  the  DS  channel. 
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5.3  Coding  for  Threshold  Detection 

In  this  section,  we  will  use  the  DS  channel  model  as  the  basis  for 
determining  the  criteria  to  guide  the  code  construction  procedure.  In  the  case 
that  the  detector  is  a fixed  threshold  device,  we  have  found  that  the  error 
probability  is  closely  related  to  the  multimodal  nature  of  the  I. I.  probability 
density.  From  expression  (4.13)  we  see  that  the  P(E)  is  a function  of  the  DS 
transition  probabilities,  the  set  of  DS  values,  and  the  distortion  coefficient, 
c,  of  the  channel.  Of  these  factors,  the  distortion  coefficient  is  out  of  the 
code  designers'  control.  However,  some  knowledge  of  this  level  is  very  useful, 
which  we  will  demonstrate  in  this  section.  We  shall  begin  by  considering  the 
limiting  case  of  a noiseless  system.  This  is  a reasonable  point  of  view  because 
the  systems  of  interest  operate  at  a high  SNR.  Furthermore,  the  noiseless 
system  provides  insight  into  the  noisy  case.  Let  us  define  two  sets  as  follows: 


D"  = {j  I o(jX-l/2c}, 
D+  = { j | c(j)s.l/2c}. 


(5.10) 


In  terns  of  these  sets,  we  may  express  the  noiseless  error  probability  (denoted 
PJE))  from  (4.25),  (4.26),  and  (4.27)  as 

P(asj)  (Qj , j+qj , j+i )+  l PCd=J)(qj,  j+cl  j , j-1  > • (5.11) 

jeD-  jeD+ 

In  order  to  demonstrate  the  dependence  of  P (E)  on  the  distortion  level,  we 


have  plotted  the  limiting  error  probability  in  Fig.  24  for  the  PST  code. 
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Figure  24.  Noiseless  error  probability  for  PST 

We  see  from  Fig.  24  that  when  0<c<  1/3,  the  noiseless  error  probability  i. 
zero.  However,  P^CE)  jumps  to  .125  at  c=1/3,  and  this  error  probability  is 
unacceptable  for  any  application.  The  values  of  c for  which  P (E)  is 
discontinuous  are  a function  of  the  DSV  because  the  first  point  of  discontinuity 
occurs  when  the  DS  is  maximum.  If  we  had  no  estimate  for  the  distortion  level 
for  a particular  application,  it  would  be  necessary  to  design  a code  with 
minimal  DSV.  However,  this  could  be  overly  conservative  because  there  is  a 
penalty  for  minimizing  the  DSV.  That  is,  when  we  design  a code  for  minimal  DSV, 

[ 

we  must  eliminate  many  codewords  from  consideration,  and  the  rate  (or 
efficiency)  of  the  code  is  reduced.  One  design  objective  is  the  selection  of 
the  most  efficient  code  for  any  application  (i.e.,  one  which  maximizes  m/n). 
However,  as  we  have  seen  from  Fig.  24,  the  magnitude  of  the  distortion  should  be 
estimated  first  in  order  to  determine  an  acceptable  DSV. 


Another  aspect  of  the  noiseless  error  probability  is  apparent  from  Fig.  24. 
That  is,  P^CE)  is  not  symmetric  about  c=0.  The  error  probability  for  negative  c 
is  less  than  that  for  positive  c,  and  the  range  of  c for  which  PW(E)  is  zero  is 


greater.  We  have  noticed  this  before  when  the  true  error  probabilities  were 
calculated  for  the  first  order  I. I.  model,  and  again  in  the  preceding  chapter 
when  the  DSV  was  introduced.  This  asymmetry  is  easy  to  explain  by  assuming  that 
1/(d-1)<  o<  1/(d-2)  whereby  P^CE)  is  nonzero  for  a code  with  d DS  values,  with 
the  errors  occurring  only  for  the  extreme  DS  values,  o(1)  ana  o(d).  When  c is 
positive,  we  have  D“={1},  and  D+={d}  which  yields 


Poo(E)=P(c  = 1)(q1>1+q1>2)+P(a=d)(qd)d+qd)d_1).  (5.12) 

When  c is  negative  in  the  interval  ( — 1 / (d— 2 ) , — 1 / (d— 1 ) ) , however,  we  find  D“={d} 
and  D+={ 1 } , whereby  we  obtain 

PjE)  = P(a  = d)(qd(d+qd(d+1)  ♦ P(a  = 1 ) ^ , ,+q,  >Q) . (5.13) 

But  because  a=1  and  a=d  are  extreme  states,  qd  d+^=q^  Q=0,  and  expression 
(5.13)  becomes 

P00(E)  = P(a=d)  qdjd+P(a  = 1)q1fl.  (5-14) 

We  see  that  if  qd  d=q1  ^=0 , the  noiseless  error  probability  is  zero.  This  is 
the  property  we  mentioned  in  the  preceding  chapter  concerning  the  (8,6,3)  code 
and  the  B6ZS  code.  From  Fig.  24,  it  is  apparent  that  the  PST  code  also  has  this 
property.  This  demonstrates  the  weakness  of  using  the  DSV  criterion  as  the  only 
evidence  in  judging  a code's  performance. 

For  the  case  when  1/(d— 1 ) < c<  1 / ( d— 2 ) , the  expression  for  P^tE)  may  be 
simplified  by  noting  for  ternary  codes;  q^  ^+q.|  2=Qd  d+<5d  d-1  = 1,  Thus,  when 
only  the  extreme  states  are  the  cause  of  noiseless  errors,  the  error  probability 


This  assumes  the  input  binary  blocks  are  equiprobable . Since  we  only  need  a 
rough  estimate  of  (5.15),  we  can  assume  that  the  stationary  probability  of  the 
encoder  state  is  unity,  and  that  n=m.  Thus  an  estimate  of  the  least  value  of 
(5.15)  is  ( 1/n)2“^n_1 ^ . Using  10“^  as  the  maximum  acceptable  error  probability, 
we  find  the  required  minimum  codeword  length  is  14,  which  is  too  long  for 
practical  purposes.  Therefore,  for  the  case  of  the  noiseless  channel,  we  may 
assume  that  if  P«,(E)  is  not  zero,  then  it  is  not  acceptable  on  the  DS  channel. 
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the  noisy  DS  channel.  We  begin  by  considering  the  contribution  to  the  error 
probability  for  a given  DS  value,  which  is  P(o  = j)P(E|  j). 


The  first  step  is  to  determine  the  values  of  the  DS  which  contribute  the 
most  to  the  error  probability.  In  particular,  we  want  to  determine  if  the 
extreme  values  always  contribute  the  most  for  a particular  code  (this  is  the 
t xcit  assumption  of  the  DSV  criterion).  We  need,  therefore,  a method  to  perform 
this  comparison.  A simplification  which  will  not  cause  much  •'ccuracy  in  this 
analysis  is  to  consider  qj  j+qj  j+i=<?j  j+Qj  j-1  = 1-  RecaH  that  when  q1  ^=qd  d=0 
and  c is  negative,  the  extreme  sums  do  not  contribute  to  the  error  probability. 
This  is  not  a problem  for  this  analysis  because  we  may  equivalently  consider 
such  a code  as  having  equal  to  (d-3)/2  instead  of  (d-1)/2  when  c is 
negative.  With  this  simplification,  we  note  that  the  contribution  to  P(E)  for 
two  DS  values,  j and  k,  will  be  equal  if 
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We  may  therefore  consider  only  the  dominant  term  in  each  side  of  (5.17)  because 
the  maximum  error  we  could  make  is  a factor  of  two  which  is  not  significant. 
Moreover,  we  may  assume  c is  positive  without  loss  of  generality.  These 
considerations  allow  us  to  rewrite  (5.17)  as 


P(a=j)Q 


(5.18) 


A useful  tool  which  facilitates  this  analysis  is  a very  tight  upper  bound  for 
the  Q-function  [31],  which  is 


Q(x)  « { V /2tt  x)  e"x2/  2 . 


(5.19) 


Upon  substituting  (5.19)  into  (5.18)  and  rearranging  the  expression,  we  find 
that  two  DS  values  will  contribute  equally  to  the  error  probability  if 

P(a=j)  1-2cc(j)  e-(V4N0)  [(1  - 2co  ( j) ) 2 - (4-  2ca(k))2] 

“ ’ (5.20) 

P(o=k)  1 -2ca (k) 


We  may  use  expression  (5.20)  to  show  that  even  though  the  (8,6,3)  code  has  a DSV 
of  one  greater  than  the  MS43,  the  extreme  values  of  the  DS  of  the  (8,6,3)  do  not 
contribute  as  much  to  the  error  probability  as  the  extreme  values  of  the  MS43 
for  a broad  range  of  distortion  coefficients  and  signal-to-noise  ratios.  Upon 
substituting  o(j)=5/2  for  MS43  and  c(k)=3  for  the  (8,6,3)  into  (5-20),  we  find 
that  these  two  DS  values  would  contribute  equally  to  the  error  probability  if 


P(a-6)  ^ 1 - 5c  e“9c(c-2/9)/4N0 

P(o=7)  1 - 6c 


(5.21) 


Expression  (5.21)  indicates  that  the  comparison  of  the  two  codes  is  sensitive  to 
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both  the  channel  distortion  and  the  SNR.  Because  we  have  consistently  used  c=.1 
in  our  examples  we  shall  perform  the  comparison  for  this  value.  The  appropriate 
probabilities  for  the  two  codes  are:  .000371  for  (8,6,3)  and  .0186  for  MS43- 
The  substitution  of  these  values  into  (5-21)  yields 


.0186 

.000371 


:1.25e.0275/N0, 


and  solving  for  Nq,  we  find  that  the  extreme  value  of  the  DS  for  the  (8,6,3) 
code  will  contribute  more  to  the  error  probability  than  that  of  the  MS43  only 
when  the  SNR  is  greater  than  21.3  db. 

This  analysis  demonstrates  the  effect  of  the  DS  probability  distribution  on 
the  noisy  DS  channel.  Although  in  the  limit  of  high  SNR  (or  severe  distortion), 
a code  with  a greater  DSV  will  produce  a greater  error  probability  than  one  with 
a lesser  DSV,  for  a specific  application  a small  probability  of  the  extreme  DS 
value  in  the  code  with  larger  DSV  may  be  sufficient  to  reduce  the  conditional 
error  probability  of  t.ie  extreme  value.  Furthermore,  even  if  the  greater  DSV 
priduces  a greater  error  probability,  it  may  be  acceptable  for  proper  system 
performance.  A similar  comparison  between  the  two  extreme  DS  values  (a  =1  and 
a=2)  of  the  (8,6,3)  code  reveals  that  the  extreme  DS  value  produces  less  error 
probability  than  its  adjacent  DS  value  whenever  the  SNR  is  less  than  47  db  (for 
c= . 1 ) . For  the  MS43  code,  however,  this  cutoff  is  only  14.7  db. 

We  will  now  apply  the  ideas  discussed  in  this  section  to  the  design  of  a 
(7,5,4)  code.  From  Table  6,  we  see  that  the  minimum  number  of  states  for  such  a 
CE  code  is  four.  The  first  step  in  any  CE  code  construction  is  the  construction 
of  the  sets  of  k-eligible  words,  where  k=1,2,3,4  in  this  case.  Although  we  have 
demonstrated  that  the  DSV  is  not  in  itself  an  infallible  criterion,  we  will  use 


it  for  a first  iteration  in  this  procedure.  We  will  want  to  determine  the 
minimum  DSV  possible  for  a (7»5,4)  code,  which  is  obviously  greater  than  or 
equal  to  four  because  each  state  corresponds  to  a different  DS  value  in  the 
first  digit  of  each  codeword.  Moreover,  if  we  determine  the  minimum  DS  value  in 
the  positive  reflecting  mapping,  and  design  a balanced  code,  the  maximum  value 
(hence  the  DSV)  will  be  determined.  To  find  the  minimum  value  of  the  DS,  the 
set  of  1-eligible  words,  Aj , is  transformed  into  an  equivalent  set  of  DS 
sequences  of  length  7.  These  sets  are  defined  as 


— { (k , , ^2 , . . • , ^ )|  | i=2 , . . • ,7y  Ajj} . 


(5.22) 


Note  that  it  is  not  necessary  to  include  the  additive  constant,  C , in  the 
construction  because  we  are  interested  in  the  number  of  DS  values,  d,  and  not 
the  actual  values  of  the  DS.  From  the  set  of  1-eligible  words  (in  which  there 
are  140  words),  we  must  choose  2^=128  words  such  that  the  smallest  value  of 
ai  » i*1,...,7»  is  as  large  as  possible.  After  this  operation,  we  find  that 
the  minimum  value  of  the  DS  in  state  one  is  zero.  Because  state  4 is  the 
complement  of  state  1,  the  maximum  value  is  5;  and  therefore,  dr6.  Now  that 
the  minimum  DSV  has  been  established,  we  eliminate  all  code  words  in  A-|  and  A2 
for  which  there  is  an  occurrence  of  a DS  value  less  than  zero  in  D1  and  D2, 
respectively.  We  have  reduced  the  number  of  eligible  words  for  the  codebook  in 
order  to  ensure  d=6.  For  the  next  iteration  in  the  design  process,  we  will 
choose  codewords  for  state  1 and  state  2 in  order  to  minimize  the  probability  of 
the  extreme  values  of  the  DS.  Because  the  code  is  balanced,  both  extreme  values 
contribute  equally  to  the  error  probability. 


u 

L-.-i 

'Ll 

u 

0 


u 

: ] 
L — J 

u 

Q 

0 

0 

& 

0 

D 

D 

n 


91 

In  Chapter  1 , we  indicated  that  the  probability  of  a DS  value  is  the 
average  of  the  conditional  DS  probabilities  over  the  set  of  encoder  states.  The 
conditional  probability  for  state  i is 

2m  n 

P(c=j|state  i)=(1/n)Pl  I 9 Z (5.23) 

1 u 1U 

u=1  q=1 

where  p^  is  the  stationary  probability  of  the  encoder  state  i,  0U  is  the 
probability  of  the  binary  input  block,  bu,  and 

n 

l ♦?<*)(J) 

, iu 
q=1 

is  the  number  of  occurrences  of  the  DS  value  j in  the  codeword  in  state  i 
corresponding  to  input  block  bu.  In  the  code  synthesis  problem  we  will  assume 
that  each  binary  input  block  is  equiprobable  (i.e.,  9u=2“m) , and  consider  the 

conditional  DS  probability  as 


(1/n)2-mPi  «i(J) 


where  *^(j)  is  the  number  of  occurrences  of  the  DS  value  j in  state  i.  In  the 
case  of  the  (7,5,4)  code  that  we  are  constructing,  ®i(J)=  (d-J+1)  and 
Pi=P4_i+1 , because  the  code  is  balanced.  The  synthesis  problem  is  the  selection 
of  the  codewords  for  state  1 and  state  2 such  that  p^^  ^(1)  is  minimized  for 
i=1,2  and  U (0,5).  We  will  proceed  heuristically  with  this  minimization. 

First,  we  note  that  there  will  be  more  occurrences  of  the  zero  DS  value  ir. 
state  1 than  state  2 because  the  DS  sequence  for  a codeword  in  state  1 is  one 
less  in  every  digit  than  the  DS  sequence  for  the  same  word  in  state  2 (see 
(5.22)).  It  is  desirable,  therefore,  to  minimize  p , which  is  the  first 


component  of  the  vector  £ which  solves 


fin  =J2 


where  n is  the  state  transition  matrix  for  the  encoder.  In  order  to  minimize 
P1 , we  shall  choose  codewords  which  minimize  the  encoder  state  transition 
probabilities  it j 1 , j=1 ,2  which  is  equivalent  to  choosing  as  many  words  as 
possible  with  characteristics  equal  to  one  and  two  in  state  1 . Upon  checking 
the  set  of  1-eligible  words,  we  find  44  potential  codewords  having 
characteristic  equal  to  one  and  30  words  having  characteristic  equal  to  two. 
Because  we  need  128  words  to  complete  the  encoding  mode  for  state  1,  we  still 
need  to  choose  54  words,  all  of  which  will  have  characteristic  equal  to  zero  or 
three.  We  choose  these  54  words  from  the  remaining  1-eligible  words  according 
to  the  number  of  zero  (and  five)  DS  values  they  contain  in  order  to  keep  the 
number  of  occurrences  of  zeroes  and  fives  as  small  as  possible.  We  find  33  such 
words  which  contain  no  zeroes,  and  14  which  contain  one  zero.  We  choose  the 
remaining  seven  words  from  those  which  contain  two  zeroes.  We  note  that  in  this 
case,  it  is  impossible  to  fill  the  encoding  mode  such  that  the  DS  transition 
probability,  q^,  is  zero.  However,  we  have  chosen  the  words  in  a manner  which 
minimizes  this  probabiity. 

In  order  to  fill  the  encoding  mode  for  state  2,  we  refer  to  the  set  of 
2-eligible  words  and  its  associated  set  of  DS  sequence  blocks,  Dg.  Our 
selection  rule  to  minimize  the  encoder  state  transitions  to  state  1 (and  state 
4)  dictates  the  selection  of  all  eligible  words  having  characteristic  equal  to 
zero  and  one,  of  which  there  are  95  such  words.  This  leaves  33  words  to  be 
chosen  from  the  eligible  codewords  having  characteristic  equal  to  -1  and  2.  We 
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will  select  these  words  according  to  the  number  of  occurrences  of  extreme  DS 
values.  In  this  case,  it  is  easier  to  maximize  the  number  of  occurrences  of  the 
DS  values  2 and  3 which  are  the  central  DS  values.  There  are  18  remaining 
eligible  words  which  contain  only  DS  values  2 and  3.  We  choose  the  last  15 
words  from  the  set  of  remaining  2-eligible  words  containing  only  one  occurrence 
of  the  DS  values  1 and  4. 

This  completes  the  design  of  a (7,5,4)  code  for  the  DS  channel.  We  note 
that  the  resulting  code  is  state  independent  decodable.  We  exhibit  the 
codewords  in  states  1 and  2 for  this  code  in  Table  7.  The  error  performance  of 
this  code  on  the  DS  channel  is  slightly  better  than  that  of  MS43,  the  difference 
being  of  minor  consequence.  However  the  (7,5,4)  code  provides  a greater  rate 
increase  (m/n=1.4)  than  the  MS43  (m/n=1.33)>  It  is  interesting  to  determinine 
whether  a better  error  performance  is  possible  for  a 7/5  code  if  we  increase  the 
DSV  by  one  (d=7),  which  is  the  same  as  that  of  the  (8,6,3)  code.  To  accomplish 
this,  we  must  design  a (7,5,5)  code  because  the  minimum  increase  in  the  DSV  for 
a balanced  4 state  code  is  two.  The  positive  reflecting  mapping  for  the  (7,5,5) 
code  will  be  identical  to  that  of  the  (7,5,4)  code.  The  encoding  mode  for  state 
2 is  only  a slight  modification  of  the  corresponding  mode  in  (7,5,4).  In  the  5 
state  code,  we  may  use  all  of  the  words  having  characteristics  equal  to  two  in 
state  2 because  state  4 is  not  equivalent  to  state  1 (as  it  was  for  the  4 state 
code).  This  means  that  we  will  use  the  11  words  having  characteristic  equal  to 
two  which  were  not  used  in  the  4 state  code.  We  replace  the  11  words  which 
contain  the  largest  number  of  extreme  DS  values  having  characteristic  equal  to 
-1  in  state  2.  The  new  state  in  the  enocoder  becomes  state  3,  and  we  choose 
codewords  for  this  state  such  that  the  state  transition  probabilities  n are 
maximized;  that  is,  all  of  the  3-eligible  words  having  characteristic  equal  to 
zero  are  used.  The  remaining  78  words  are  chosen  to  have  characteristic  equal 
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Codewords  for  state  2 
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to  1 . Because  the  code  is  balanced,  state  3 (which  is  the  central  state  in 
this  case)  must  be  a balanced  mode  which  requires  choosing  the  codewords  having 
characteristic  equal  to  1 in  complementary  pairs.  Prior  to  actually 
constructing  the  codebook,  we  may  perform  the  comparative  analysis  developed  in 
this  section  in  order  to  determine  whether  we  will  gain  an  improvement  in  error 
performance  by  increasing  the  DSV.  Because  the  distribution  of  the  codewords  by 
characteristic  in  each  state  is  known,  the  stationary  probabilities  of  the 
encoder  states  may  be  calculated.  An  estimate  of  the  probability  of  the  extreme 
DS  values  may  be  made  based  on  the  number  of  occurrences  of  the  0 DS  value  in 
states  1 and  2 of  the  (7,5,4)  code.  When  this  analysis  was  made,  we  found  that 
for  signal-to-noise  ratios  greater  than  16  db.,  the  extreme  value  for  the  5 
state  code  contributes  more  error  probability  than  that  of  the  4 state  code  (for 
c=.1).  Thus,  in  this  case,  we  have  nothing  to  gain  from  the  more  complex  code. 


5.4  CQSUnft  £2r 


When  the  Digital  Sum  Feedback  equalization  scheme  discussed  in  Chapter  4 is 
used,  the  multimodal  nature  of  the  I. I.  density  is  not  a factor  in  the  error 
probability.  For  channels  with  a slow  response,  we  have  seen  that  the  error 
probability  is  only  slightly  greater  than  that  of  the  channel  with  no  I. I.  The 
slight  increase  which  does  occur  is  a result  of  the  conditional  variances  of  the 
I. I.  In  section  4.4,  we  demonstrated  that  the  conditional  variances  are 
strongly  dependent  on  the  response  duration  of  the  channel  (viz.,  the  variances 
increase  as  the  duration  decreases).  We  do  not  expect  that  the  conditional 
variances  will  be  significantly  altered  through  the  code  design  process. 
However,  we  may  define  a practical  coding  problem  for  this  system  by  considering 
the  operation  of  the  DS  correction  device  in  the  feedback  loop. 
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We  have  mentioned  that  a single  error  in  the  estimated  sequence  will 
produce  an  error  of  magnitude  c in  the  feedback  loop  of  the  equalizer.  We  know 
moreover  that  this  error  will  be  corrected  when  the  channel  sequence  reaches  the 
terminal  DS  value.  The  coding  problem  for  this  equalization  scheme  is  thus 
defined  to  be  the  selection  of  a codebook  which  minimizes  the  average  time  until 
a terminal  DS  value  is  reached  by  the  encoder.  The  first  objective  of  this  code 
design  problem  has  been  mentioned;  that  is,  the  construction  procedure  should 
minimize  the  number  of  DS  values,  d.  Because  the  errors  are  effectively 
independent  of  the  channel  sequence  statistics,  we  may  assume  that  an  error  will 
occur  at  each  DS  value  with  equal  probability.  In  order  to  minimize  the  time  to 
correction,  it  is  clear  that  we  must  maximize  the  probability  of  the  extreme  DS 
values. 

We  will  conclude  this  section  with  an  illustration  of  the  code  design 
process  for  a (7*5,4)  code.  The  procedure  for  designing  the  codebook  is  similar' 
to  the  one  illustrated  in  section  5.3,  however,  the  objective  is  the  opposite. 
We  begin  with  the  formation  of  the  sets  of  eligible  words,  A1  and  Ag.  Recall 
that  the  minimum  value  of  d is  6,  and  we  eliminate  all  codewords  in  A1  and  A2 
for  which  there  is  an  occurrence  of  a DS  value  less  than  zero.  In  this  case,  we 
want  to  maximize  the  number  of  occurrences  of  the  0 and  5 DS  values,  therefore 
we  should  choose  all  of  the  1-eligible  words  which  have  characteristic  equal  to 
zero  and  three  in  order  to  maximize  the  stationary  probabilities  of  the  extreme 
encoder  states  (1  and  4).  Upon  checking  the  set  A1f  we  find  60  such  codewords, 
which  leaves  68  codewords  to  be  chosen  for  this  encoding  mode.  In  the  next 
pass,  we  choose  words  containing  DS  values  of  0's  and  5's  from  the  remaining 
1-eligible  words  whereby  we  find  19  more  codewords.  The  final  49  words  are 
selected  from  the  remaining  1-eligible  words  according  to  the  number  of  DS 
values  1 and  4 which  occur  (viz.,  those  words  which  maximize  the  number  of 
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occurrences  of  these  values). 

When  choosing  codewords  for  state  2,  we  wish  to  maximize  the  number  of 
encoder  state  transitions  to  the  extreme  states,  1 and  4.  This  indicates  that 
all  codewords  having  characteristics  equal  to  -1  and  2 should  be  chosen. 
Unfortunately,  we  may  not  blindly  fill  this  mode  with  these  codewords  because 
the  final  codebook  will  not  be  state  independent  decodable.  We  have  used  all 
except  five  codewords  having  characteristic  equal  to  0 in  state  1 . Because  the 
code  is  balanced,  all  codewords  except  5 having  characteristic  equal  to  zero 
will  be  elements  of  state  4 (the  five  exceptions  being  the  complements  of  the 
unused  words  in  state  1).  The  SID  sufficient  condition  (5.9)  requires  that  all 
codewords  which  are  used  in  states  1 and  4 must  also  be  used  in  states  2 and  3. 
Therefore,  we  must  include  the  45  eligible  codewords  in  state  2 having 
characteristic  equal  to  zero  except  for  the  5 codewords  which  do  not  appear  in 
both  states  1 and  4. 

Next  we  may  choose  the  30  codewords  having  characteristic  equal  to  2 
because  there  is  no  conflict  with  the  SID  condition.  This  leaves  58  words  to  be 
chosen  in  order  to  complete  the  encoding  mode  for  state  1.  It  would  be 
desirable  to  use  all  of  the  codewords  having  characteristic  equal  to  -1 , 
however,  this  would  conflict  with  the  SID  requirement.  To  satisfy  the  SID 
condition,  we  begin  by  choosing  those  words  having  characteristic  equal  to  -1 
whose  complements  were  not  used  in  state  1,  of  which  there  are  seven  codewords. 
For  the  remaining  51  words,  we  must  choose  chodewords  such  that  every  codeword 
having  characteristic  equal  to  -1  appears  with  its  complement.  Therefore  we  may 
choose  25  complementary  pairs  from  the  remaining  2-eligible  words.  The  last 
codeword  must  be  chosen  from  the  remaining  eligible  words  having  characteristic 
equal  to  one.  The  final  51  codewords  are  chosen  such  that  the  number  of 
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occurrences  of  the  extreme  DS  values  is  maximum. 

The  code  obtained  from  this  design  procedure  has  an  error  performance  for 
the  DSF  equalized  channel  which  is  slightly  worse  (approximately  .3  db)  than 
that  of  the  MS43  (see  Fig.  20  in  Chapter  4).  The  probability  of  the  extreme  DS 
values  is  .0374  as  opposed  to  .0173  for  the  (7,5,4)  code  designed  in  section 
5.3.  We  see  that  for  these  parameters,  the  range  of  the  extreme  DS 
probabilities  is  not  great.  This  is  not  surprising  because  the  amount  of  choice 
available  for  the  positive  reflecting  mapping  is  not  great  (128  words  out  of  1 36 
eligible  when  d=6).  Furthermore,  the  SID  requirement  is  a further  constraint 
when  we  attempt  to  maximize  the  extreme  DS  probabilities. 
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CHAPTER  6 


DECODING  AND  ERROR  CONTROL  AT  THE  TERMINAL 

6.1  Basflding  at  its  I.ermpal 

We  concluded  Chapter  2 with  a short  discussion  the  model  for  the 
destination  terminal  of  the  repeater ed  line  (see  Fig.  8).  We  may  recall  that 
the  terminal  is  identical  to  a repeater  with  the  exception  of  a decoder  which 
converts  the  received  (estimated)  channel  sequence  into  an  estimate  of  the 
original  binary  information  sequence.  In  this  section,  we  shall  discuss  the 
structure  of  this  decoder,  and  mention  some  aspects  of  the  error  performance  of 
the  decoder  relative  to  the  error  performance  of  the  channel  symbol  sequence. 

Because  the  encoder  processes  the  original  binary  information  sequence  in 
blocks  of  m bits  and  emits  blocks  of  n symbols,  the  decoder  must  process  the 
estimated  channel  symbol  sequence  in  blocks  of  n symbols.  However,  the  decision 
device  which  produces  the  estimate  of  the  channel  sequence  is  a symbol-by-symbol 
detector,  and  thus  does  not  produce  a sequence  which  may  be  directly  decoded. 
It  is  necessary,  therefore,  for  the  decoder  to  preprocess  the  estimated  symbol 
sequence  in  order  to  provide  a sequence  of  symbol  blocks  having  length  n.  This 
preprocessing  is  called  word  synchronization  or  framing.  When  the  encoder  is 
counter-encodable,  the  channel  sequence  contains  sufficient  redundancy  for  the 
word  synchronization  to  be  accomplished  without  additional  information  (e.g., 
extra  symbols  specifically  for  synchronization).  A general  method  for  word 
synchronization,  reported  by  Preparata  and  Bellato  [11],  is  the  construction  of 
a finite  state  machine  (called  the  monitor)  which  contains  one  state  for  each 
subset  of  states  in  the  encoder,  plus  an  alarm  state.  When  the  estimated 
channel  sequence  is  produced  by  the  detector,  it  passes  through  a framing 


circuit  which  divides  the  symbol  sequence  into  blocks  of  n-tuples.  This  block 
(vector)  sequence  is  then  input  to  the  monitor.  The  monitor  is  initially  in  a 
state  which  corresponds  to  the  complete  set  of  encoder  states  (this  represents 
its  ignorance  of  the  actual  encoder  state).  As  each  n-tuple  enters  the  monitor, 
its  characteristic  (real  sum  of  the  symbols)  is  calculated  and  the  monitor 
changes  state  according  to  the  result.  Eventually,  the  monitor  reaches  a state 
corresponding  to  a single  encoder  state.  From  this  time  on,  it  mimics  the  state 
transitions  of  the  encoder  (i.e.,  there  is  a sub-automaton  imbedded  in  the 
monitor  which  is  isomorphic  to  the  encoder).  The  PST  code  (see  Table  1) 
provides  a simple  example.  The  state  diagram  of  its  monitor  is  illustrated  in 

Fig.  25.  The  sub-autanaton  is  shown  in  the  box  and  the  alarm  state  is  denoted 

* 

by  C . The  numbers  associated  with  the  state  transitions  are  thr- 
characteristics  of  the  received  codewords. 
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Suppose  that  the  monitor  is  in  its  initial  state,  {1,2},  and  it  receives  the 
block  '0+'.  It  calculates  the  characteristic,  +1,  and  changes  to  the  state 


I 


corresponding  to  the  encoder  state  2,  {2}.  The  monitor  in  this  case  has  become 
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synchronized  with  the  encoder  because  the  only  way  the  encoder  could  have 
produced  the  sequence  0+  is  if  it  had  been  in  state  1;  after  producing  the 
codeword  '0+* , its  next  state  would  have  been  state  2.  In  the  absence  of 
transmission  errors,  the  monitor  would  remain  synchronized  with  the  PST  encoder. 

If  there  are  transmission  errors,  or  if  for  any  reason  the  word 
synchronization  is  lost  (i.e.,  the  n-tuples  entering  to  monitor  do  not 
correspond  to  codewords,  but  rather  the  concatenation  of  symbols  from  two 
adjacent  codewords),  the  characteristics  of  these  n-tuples  will  eventually 
violate  the  encoder  state  transition  rule.  When  this  occurs,  the  monitor  enters 
the  alarm  state.  It  then  signals  a device  which  we  call  the  framing  strategy 
device  which  must  decide  if  the  violation  occurred  as  a result  of  a transmission 
error  or  a misframed  condition.  The  monitor  then  enters  its  initial  state  .nd 
attempts  to  become  synchronized  with  the  encoder  again.  The  framing  strategy 
device  must  also  have  the  capability  to  adjust  the  framing  circuit  when  it 
decides  that  a misframed  condition  prevails.  Fig.  26  illustrates  a block  model 
of  the  word  synchronization  system. 


Figure  26.  Word  synchronization  system 


The  strategy  employed  for  determining  the  word  synchronization  is  based  on  the 
statistics  of  the  encoder.  For  codewords  of  length  n,  there  is  one  correct 
frame  position  (phase)  and  n-1  incorrect  phases.  The  statistics  of  the  state 
transition  violations  for  each  of  the  incorrect  phases  are  used  to  determine  the 
expected  frequency  of  occurrence  of  these  violations.  The  framing  strategy  box 
in  Fig.  26  identifies  the  frame  condition  based  on  these  statistics  and 
transnits  the  appropriate  control  signal  to  the  framing  circuit  in  order  to 
obtain  word  synchronization.  A specific  framing  strategy  for  the  MS43  code, 
which  is  based  on  a simplified  encoder  state  monitor,  has  been  reported  in  [ 28 ] . 

When  the  symbol  sequence  has  been  properly  framed,  the  blocks  of  n-tuples 
may  be  converted  into  the  corresponding  blocks  of  binary  m-tuples  by  the  decoder 
when  no  transmission  errors  have  occurred.  Because  the  CE  codes  are  non-linear 
in  general,  a table  lookup  scheme  is  necessary  to  accomplish  this  conversion. 
This  is  not  a difficult  task  because  the  codes  under  consideration  are  short 
(n<7).  for  which  current  LSI  technology  is  sufficient.  The  n-tuples  are 
converted  to  addresses  in  a read-only-memory  which  contain  the  binary  m-tuples. 

In  Chapter  5,  we  introduced  the  property  of  state  independent  decodab ility 
for  a CE  code.  For  an  SID  code,  the  inverse  of  the  encoding  function  is 
independent  of  the  encoder  state;  that  is,  bu=f-1  (iiu)  where  . 

When  decoding  an  SID  code,  we  must  allocate  one  address  in  the  ROM  for  each 
distinct  codeword  (note  that  in  general  there  are  more  than  2m  distinct 
codewords).  However,  if  the  code  is  not  SID,  then  2m  addresses  must  be 
allocated  for  each  encoder  state  for  a total  of  r2m  2n-bit  addresses.  The 
memory  requirements  are  always  greater  for  the  state  dependent  decoding  in 
practical  cases.  Moreover,  when  the  decoding  is  state  dependent,  the  state 
identification  monitor  must  also  be  incorporated  into  the  decoding  operation. 
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Because  the  word  synchronizer  estimates  the  state  of  the  encoder  (after  the 
identification  transient) , the  estimate  may  be  provided  to  the  decoder  for  use 
in  state  dependent  decoding  (i.e.,  the  estimate  tells  the  decoder  which  decoding 
table  to  use) . We  will  see  later  that  this  dependence  on  the  state 
identification  monitor  can  have  detrimental  effects  on  the  error  probability, 
but  can  be  used  to  advantage  in  an  error  detection  scheme. 
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In  the  previous  chapters,  we  have  been  concerned  with  the  channel  symbol 
errors  in  the  repeater  decision  device  when  the  channel  sequence  is  disturbed  by 
Gaussian  noise  and  I. I.  is  present.  It  is  clear  that  there  is  a direct 
relationship  between  the  channel  symbol  errors  and  the  bit  errors  at  the  output 
of  the  decoder.  Because  the  original  information  sequence  was  encoded  into 
blocks  of  channel  symbols,  a single  error  in  the  channel  sequence  may  cause  as 
many  as  m bit  errors  in  the  decoded  information  sequence.  Thus,  we  may  consider 
the  bit  error  probability,  denoted  Pfa(E),  to  be  mP(E).  This  error  propagation 
as  a result  of  the  block  encoding  is  not  serious  as  long  as  the  block  length  of 
the  code  is  short  (an  order  of  magnitude  in  error  probability  corresponds  to 
approximately  1 db.  of  reduction  in  the  SNR  for  applications  we  have 
discussed).  This  approximation  of  Pb(E)  is  valid  when  we  assume  that  the 
decoder  operates  on  the  properly  framed  block  of  channel  symbols.  When  the 
channel  sequence  is  improperly  framed,  the  probability  of  a bit  error  is  very 
high  (close  to  one).  Fortunately,  there  is  enough  redundancy  in  the  channel 
sequence  to  detect  many  error  patterns.  We  discuss  the  various  error  detection 
schemes  briefly  in  the  following  section. 
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6.2  Error  Detection  In  U0£  Terminal 

We  have  mentioned  error  monitoring  in  the  repeaters  in  Chapter  4 when  the 
DSF  equalization  scheme  was  discussed.  In  addition  to  this,  we  may  detect 
errors  in  the  decoder  with  either  the  state  identification  monitor  or  the  table 
lockup  procedure  (or  both). 


J 


Error  detection  in  the  state  identification  monitor  begins  after  the 
monitor  becomes  synchronized  with  the  encoder.  The  error  patterns  which  may  be 
identified  are  a function  of  the  state  of  the  monitor/ encoder . The  condition 
for  detection  is  that  the  error  pattern  must  change  the  characteristic  of  the 
n- tuple  into  a value  which  does  not  occur  in  the  encoder  when  the  encoder  is  in 
its  present  state.  For  example,  in  the  simplest  case  of  the  PST  code,  if  the 
encoder  and  monitor  are  in  state  1,  the  error  pattern  must  produce  a pair  of 
symbols  which  has  characteristic  equal  to  -2,-1,  or  2 in  order  to  be  detected. 
The  effectiveness  of  this  error  detection  scheme  is  related  to  the  number  of 
state  transitions  which  may  occur  from  each  encoder  state.  If  an  encoder  has  a 
minimal  number  of  states  and  the  codewords  are  chosen  in  order  to  minimize  the 
number  of  states  to  which  it  may  change,  the  error  detection  will  be  most 
effective.  We  would  expect,  therefore,  that  an  encoder  having  only  one  state 
(in  which  all  words  have  characteristic  equal  to  zero,  of  course),  would  be  the 
most  effective  because  all  errors  which  change  the  characteristic  of  the 
received  n-tuple  from  zero  are  detectable. 

A disadvantage  of  this  error  detection  scheme  may  be  recognized  from  the 
action  of  the  state  identification  monitor  after  it  detects  a violation.  We  may 
recall  that  the  monitor  enters  the  alarm  state,  and  signals  the  framing  strategy 
device.  It  then  enters  its  initial  state  and  begins  the  state  identification 
operation.  The  error  detection  capability  of  the  monitor  is  reduced  until  it 
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may  provide  error  detection  during  the  state  identification  phase  of  its 
operation.  A detailed  quantitative  analysis  of  the  probabilities  involved  has 
not  been  reported. 
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Error  detection  schemes  implemented  in  the  table  lookup  procedure  are 
possible  for  CE  codes  because  in  general  the  complete  set  of  n-tuples  do  not 
appear  in  the  codebook.  The  error  detection  is  accomplished  by  allocating  one 
address  in  the  ROM  for  each  of  the  3n  n-tuples.  The  contents  of  those  addresses 
which  can  not  be  generated  by  the  encoder  outputs  contain  the  error  message. 
The  effectiveness  of  this  scheme  is  dependent  on  whether  state  dependent  or 
state  independent  decoding  is  performed.  When  the  decoding  is  state 
independent,  an  error  is  detected  whenever  the  received  n-tuple  does  not  appear 
anywhere  in  the  codebook.  The  number  of  received  n-tuples  which  will  generate 
the  error  message  is  3n-N,  where  N is  the  number  of  distinct  words  in  the 
codebook  and  is  larger  than  2m  in  general.  For  example,  the  MS43  codebook 
contains  every  3-tuple  except  '000'  , and  therefore,  the  SID  error  detection 
scheme  would  not  be  very  effective.  If  the  decoding  is  state  dependent,  then 
there  are  3n-2m  n-tuples  which  would  indicate  an  error,  providing  greater  error 
detection  capability.  Of  course,  more  memory  is  required  in  the  state  dependent 
decoding  table.  For  a given  code,  the  effectiveness  of  error  detection  in  the 
monitor  versus  that  in  the  table  lookup  would  need  to  be  analyzed  to  determine 
if  the  greater  complexity  in  the  table  lookup  is  warranted.  We  also  mention 
that  a combination  of  error  detection  in  the  monitor  and  in  the  table  lookup 
procedure  may  be  more  efficient.  The  memory  space  necessary  in  the  table  would 
be  reduced  to  the  number  of  n-tuples  which  have  the  same  characteristic  as  the 
codewords  in  each  encoder  state. 
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We  must  emphasize  that  even  If  state  dependent  decoding  is  used,  it  is  most 
desirable  that  the  code  be  SID.  If  a code  is  not  SID,  correct  decoding  is 
dependent  of  the  operation  of  the  monitor.  Whenever  an  error  is  detected  by  the 
monitor,  the  decoding  process  is  interrupted  until  the  monitor  regains 
synchronization  with  the  encoder.  Thus  an  error  which  is  confined  to  a single 
n- tuple  will  disable  the  decoder  for  an  indefinite  length  of  time.  We  see  that 
although  state  dependent  decoding  may  be  desirable  from  the  standpoint  of  error 
detection,  it  is  detrimental  to  the  decoding  process  if  the  code  is  not  SID. 

In  this  section  we  have  presented  a general  discussion  of  error  detection 
methods  for  CE  codes.  In  a given  situation,  the  tradeoffs  involved  would  need 
to  be  considered,  and  a more  detailed  quantitative  analysis  performed  in  order 
to  determine  the  best  method.  In  the  next  section,  we  will  take  these  error 
control  considerations  one  step  further,  and  consider  single  error  correcting  CE 
codes. 


6.3  lUuatratlQD  al  Single  Error  Correcting  £E  Codes 

The  correction  of  errors  in  the  encoder/decoder  pair  with  CE  codes  is 
possible  without  a great  reduction  in  the  transmission  rate  of  the  channel 
symbols.  In  this  section,  we  will  apply  a variation  of  a known  construction  for 
nonlinear  single  error  correcting  codes  to  demonstrate  the  existence  of  CE  codes 
which  correct  single  adjacent  errors. 

We  define  a single  adlacent  error  in  a codeword  containing  n ternary 

symbols  as  the  vector  £=(e1t  e2 en)  where  ej=±1  for  some  j=1,...,n,  and  e^O 

for  i$J.  Vhen  a single  adjacent  error  occurs  in  the  jth  symbol  of  a codeword  a, 
the  received  symbol  which  has  been  altered  must  still  equal  -1,0,  or  1.  The 
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definition  of  the  single  error  is  identical  to  the  definition  of  a single  error 
given  in  section  4.2  with  the  additional  constraint  that  only  one  such  error 
occurs  within  a block  of  n symbols  corresponding  to  a codeword.  Note  that  this 
definition  is  weaker  than  the  definition  of  symmetric  errors  usually  encountered 
in  the  coding  literature.  In  Fig.  27.  we  illustrate  the  bipartite  graphs  which 
characterize  the  ternary  symmetric  channel,  and  the  adjacent  error  channel. 


(*)  (b) 

Figure  27.  (a)  Ternary  symmetric  channel,  (b)  Adjacent  error  channel 


The  errors  we  are  discussing  are  not  symmetric  because  each  ternary  symbol  is 
not  subject  to  all  possible  errors;  the  zero  symbol  may  be  altered  by  ±1, 
however,  the  nonzero  symbols,  ±1,  may  only  be  altered  by  ?1;  respectively. 
First,  we  describe  a method  of  constructing  nonlinear  ternary  codes  which  are 
capable  of  correcting  single  adjacent  errors[29].  It  is  similar  to  a 
construction  given  in  [30]  for  codes  which  are  capable  of  correcting  asymmetric 
errors  (a  weaker  property  than  adjacent  errors).  We  define  the  code  as  the  set 
of  ternary  n-tuples,  a=(a1f  a2,...,an),  such  that 


n 

Y iat  = b mod(2n+1 ) . 
i-1 


(6.1) 
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where  b is  a constant  with  absolute  value  less  than  n,  and  a^e  {-1,0,1}.  If  we 
asstme  that  a single  error  occurs  in  the  ith  digit  when  the  codeword  a is 
transmitted,  the  received  n-tuple  is 

X*  (a, *1-1 '*1*1  , »ai+1 » • • • »an^ * (6.2) 

We  may  correct  this  error  by  computing  the  quantity  ("syndrome") 

n 

t-b  ♦ l ir^]  mod(2n+1 ) s ±i  (6.3) 

i»< 

and  noting  that  the  absolute  value  of  (6.3)  is  the  location  of  the  error  and  the 
sign  is  the  sign  of  the  error.  The  error  is  corrected,  therefore,  by 
subtracting  -1  from  the  1th  digit  in  the  received  n-tuple.  All  single  adjacent 
errors  may  be  corrected  with  this  procedure.  This  may  be  demonstrated  by 
considering  the  decoding  operation  on  two  received  n-tuples;  one  with  an  error 
in  position  i,  and  the  other  in  position  J.  The  two  syndromes,  (6.3).  will  be 
identical  iff  i-J  mod(2n+1 ) in  the  case  where  the  errors  have  the  same  sign, 
and  iff  |i|+|j|=  0 mod(2n+1)  in  the  case  where  the  errors  have  opposite  sign. 
The  former  case  is  true  only  when  i=J  because  the  magnitudes  of  both  quantities 
are  less  than  n,  and  the  latter  case  is  never  true  because  the  sum  of  the 
magnitudes  is  always  less  than  2n+1 . 

It  is  interesting  to  note  that  a lower  bound  on  the  number  of  codewords  for 
a cede  constructed  by  this  procedure  is  the  same  as  the  Hamming  upper  bound  for 
ternary  single  symmetric  error  correcting  codes.  If 
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l iaj^  mod(2n+1) 
i-1 
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is  computed  for  each  of  the  3n  ternary  n-tuples,  and  the  n-tuples  grouped 
together  according  to  the  result,  the  pigeon-hole  principle  assures  us  that 
there  will  be  at  least  3n/(2n+1 ) n-tuples  which  have  the  same  value  of  b,  for 
some  b. 


We  will  use  this  construction  to  show  the  existence  of  a few  short  CE  codes 
which  are  capable  of  correcting  a single  adjacent  error  while  not  requiring  a 
great  penalty  in  the  symbol  transmission  rate,  m/n.  A computer  program  has 
calculated  the  number  of  codewords  which  may  be  used  in  such  a code  for 
blocklengths  up  to  seven.  The  set  of  possible  n-tuples  having  non-negative 
characteristics  was  first  partitioned  into  (n+1)  sets,  with  each  set 
corresponding  to  a distinct  characteristic  1,...,n,  and  a set  for  the  words 
having  characteristics  equal  to  zero.  The  number  of  words  having  negative 
characteristics  which  have  a value  b in  (6.1)  is  equal  to  the  number  of  words 
with  the  corresponding  positive  characteristics  having  the  value  -b. 

Recall  from  Chapter  5 that  we  have  constrained  the  design  of  CE  codes  to 
the  class  of  balanced  and  state  independent  decodable  codes.  A necessary  and 
sufficient  condition  for  state  independent  decoding  of  the  single  error 
correcting  codes  is 


n n 

a 

l ia±  mod(2n+1)=  £ iaj^  mod(2n+1)  (6.4) 

i-1  i=1 


for  two  codewords,  .a  and  £,  being  elements  of  distinct  states.  Moreover,  we 
know  that  in  a balanced  code,  if  codeword  .a  appears,  then  its  complement,  S', 
also  appears  somewhere  in  the  codebook.  In  order  to  satisfy  expression  (6.4),  a 
balanced  code  must  contain  only  codewords  for  which  expression  (6.1)  equals  zero 
(b=0).  Table  8 contains  the  number  of  codewords  for  each  characteristic  which 


I 

» 


i 


110 


* 

f 

' 

have  this  property  for  blocklengths  equal  to  5,  6,  and  7. 

' 

Table  8.  Number  of  available  codewords  for  SEC  balanced  codes 


LENGTH 

0 

1 

CHARACTERISTIC 

2 3 4 

5 

6 

7 

5 

6 

4 

1 

2 

1 

0 

- 

- 

6 

14 

9 

4 

5 

1 

0 

0 

- 

7 

32 

23 

13 

11 

8 

1 

0 

0 

From  Table  8,  we  find  that  the  best  length-5  code  is  a (3.5,2)  code  for 
which  there  are  two  extra  words  available  for  the  positive  reflecting  mapping. 
This  code  is  guaranteed  to  be  SID  because  all  2-state  codes  satisfy  the 
sufficient  condition  (5.9).  The  highest  rate  code  for  n=6  is  a (5,6,4)  code  for 
which  there  are  just  32  codewords  available  for  the  positive  reflecting  mapping. 
An  SID  code  may  be  constructed  by  using  all  codewords  having  characteristics 
equal  to  zero  and  one  in  the  mapping  for  state  2.  Finally,  for  n=7,  the  highest 
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SUMMARY  AND  CONCLUSIONS 
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In  this  thesis,  we  have  investigated  the  relationship  between  the  structure 
of  counter-encodable  codes  and  the  symbol  error  probability  for  a fixed 
threshold  detector  on  a linear  dispersive  additive  white  Gaussian  noise  channel. 
We  have  shown  that  the  encoder  state  process  may  be  utilized  to  summarize  the 
past  sequence  of  channel  codewords  for  the  purpose  of  determinng  the  probabilty 
distribution  of  the  I. I.  We  then  specialized  this  idea  to  the  DS  process 
associated  with  the  channel  symbol  process.  Because  the  DS  assumes  values  in  a 
finite  set,  we  were  able  to  relax  the  usual  restriction  of  truncating  the 
channels'  time  response  in  order  to  calculate  the  error  probability  via  the 
Gram-Charlier  series.  We  then  found  a set  of  functional  equations  for  the 
conditional  characteristic  functions  of  the  I. I.  for  a first  order  error 
channel.  This  allowed  us  to  calculate  the  moments  of  the  I. I.  by  recursively 
solving  systems  of  linear  equations,  whereby  the  error  probability  was 
calculated  by  utilizing  the  conditional  moments.  The  effects  of  the  distortion 
coefficient,  c,  and  the  rate  of  decay,  X , on  the  error  probability  for  the 
first  order  error  channel  were  illustrated  for  several  known  CE  codes..  We 
found  that  different  codes  exhibit  significantly  different  error  performance. 
The  largest  difference  in  performance  occurred  for  channels  with  a slow  decay 
rate.  We  also  illustrated  that  the  effect  of  this  decay  rate  on  the  performance 
of  a single  code  is  significant  with  the  worst  error  performance  occurring  fcr 
the  slowest  rate  of  decay  ( X=  1). 
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These  results  led  us  to  investigate  a simple  channel  model,  the  DS  channel, 
which  was  found  to  produce  accurate  results  for  channels  with  a slow  response. 
The  major  result  arising  from  the  DS  channel  model  is  the  multimodal  probability 
density  function  of  the  I. I.  We  found  one  mode  for  each  value  of  the  DS  with 
each  mode  being  equal  to  the  DS  value  scaled  by  the  channel  distortion 
coefficient,  c.  Furthermore,  we  found  that  when  X<  1,  the  assumption  that  each 
mode  is  Gaussian  with  variance  equal  to  the  conditional  I. I.  variance  produced 
extrenely  accurate  error  probability  results. 

Another  result  arising  from  the  DS  channel  model  is  the  design  of  a simple 
decision  feedback  equalization  scheme  which  keeps  track  of  the  DS  of  the 
dectected  channel  sequence  and  compensates  for  the  strong  bias  in  the  I. I.  We 
found  that  the  deleterious  effects  of  the  I. I.  on  the  error  probability  were 
effectively  reduced  with  this  scheme.  This  scheme  is  effective  because  it  is 
based  directly  on  the  statistics  of  the  I. I.  In  addition,  we  discussed  a method 
of  controlling  the  error  propagation  which  is  inherent  to  all  decision  feedback 
schemes. 

The  relatively  simple  expression  for  the  error  probability  on  the  DS 
channel  provides  a convenient  method  of  comparing  the  error  performance  of 
different  codes.  We  found  that  the  number  of  DS  values,  the  DS  transition 
probabilities,  and  the  steady  state  DS  probabilities  should  be  considered  before 
judging  code  performance.  This  is  a more  accurate  technique  than  considering 
only  the  number  of  DS  values  which  is  current  practice.  All  of  these 
considerations  led  to  a procedure  for  constructing  CE  codes  which  minimize  the 
error  probability  on  the  DS  channel.  We  have  shown  an  example  of  this 
construction  for  a (7,5,4)  CE  code  which  exhibits  better  error  performance  than 
the  known  MS43,  yet  also  provides  a greater  rate  increase.  We  also  illustrated 


a coding  procedure  for  the  DSF  equalization  scheme.  The  objective  for  these 
constructions  is  opposite  that  for  the  fixed  threshold  detector  case;  however, 
the  construction  procedure  is  similar. 


In  the  last  chapter,  we  discussed  some  problems  associated  with  decoding  CE 
codes.  In  particular,  a general  structure  for  word  framing  was  presented.  We 
then  discussed  techniques  for  error  detection  at  the  terminal.  Finally,  we 
demonstrated  the  existence  of  single  error  correcting  CE  codes  via  a known 
construction  of  nonlinear  ternary  error  correcting  codes. 
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